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Historical Introduction 


Mathematics has a rich history, going back at least 5000 years. Very few subjects 
still make use of ideas that are as old as that, but in mathematics, important dis¬ 
coveries have lasting value. Most of the latest mathematical research makes use of 
theorems that were published last year, but it may also use results first discovered by 
Archimedes, or by some unknown Babylonian mathematician, astronomer, or priest. 
For example, ever since Archimedes proved (around 250 BC) that the volume of a 
sphere is what we would now write as |7Tr 3 , that discovery has been available to any 
mathematician who is aware of the result, and whose research involves spheres. Al¬ 
though there are revolutions in mathematics, they are usually changes of viewpoint or 
philosophy; earlier results do not change—although the hypotheses needed to prove 
them may. In fact, there is a word in mathematics for previous results that are later 
changed: they are called ‘mistakes’. 

The history of Galois theory is unusually interesting. It certainly goes back to 
1600 BC, where among the mud-brick buildings of exotic Babylon, some priest or 
mathematician worked out how to solve a quadratic equation, and they or their stu¬ 
dent inscribed it in cuneiform on a clay tablet. Some such tablets survive to this day, 
along with others ranging from tax accounts to observations of the motion of the 
planet Jupiter, Figure 1 (Left). 

Adding to this rich historical brew, the problems that Galois theory solves, pos¬ 
itively or negatively, have an intrinsic fascination—squaring the circle, duplicating 
the cube, trisecting the angle, constructing the regular 17-sided polygon, solving the 
quintic equation. If the hairs on your neck do not prickle at the very mention of these 
age-old puzzles, you need to have your mathematical sensitivities sharpened. 

If those were not enough: Galois himself was a colourful and tragic figure—a 
youthful genius, one of the thirty or so greatest mathematicians who have ever lived, 
but also a political revolutionary during one of the most turbulent periods in the 
history of France. At the age of 20 he was killed in a duel, ostensibly over a woman 
and quite possibly with a close friend, and his work was virtually lost to the world. 
Only some smart thinking by Joseph Liouville, probably encouraged by Galois’s 
brother Alfred, rescued it. Galois’s story is one of the most memorable among the 
lives of the great mathematicians, even when the more excessive exaggerations and 
myths are excised. 

Our tale therefore has two heroes: a mathematical one, the humble polynomial 
equation, and a human one, the tragic genius. We take them in turn. 
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FIGURE 1: Left. A Babylonian clay tablet recording the motion of lupiter. Right. A 
page from Pacioli’s Summa di Arithmetica. 


Polynomial Equations 

A Babylonian clay tablet from about 1600 BC poses arithmetical problems 
that reduce to the solution of quadratic equations (Midonick 1965 page 48). The 
tablet also provides firm evidence that the Babylonians possessed general meth¬ 
ods for solving quadratics, although they had no algebraic notation with which 
to express their solution. Babylonian notation for numbers was in base 60, so 
that (when transcribed into modern form) the symbols 7,4;3,11 denote the number 
7 x 60 2 + 4 x 60 + 3 x 60 _1 + 11 x 60~ 2 = 25440 In 1930 the historian of sci¬ 
ence Otto Neugebauer announced that some of the most ancient Babylonian problem 
tablets contained methods for solving quadratics. For instance, one tablet contains 
this problem: find the side of a square given that the area minus the side is 14,30. 
Bearing in mind that 14,30 = 870 in decimal notation, we can formulate this prob¬ 
lem as the quadratic equation 

x 2 — x = 870 


The Babylonian solution reads: 

Take half of 1, which is 0:30, and multiply 0:30 by 0:30, which is 
0:15. Add this to 14,30 to get 14,30:15. This is the square of 29:30. Now 
add 0;30 to 29:30. The result is 30, the side of the square. 
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Although this description applies to one specific equation, it is laid out so that similar 
reasoning can be applied in greater generality, and this was clearly the Babylonian 
scribe’s intention. The method is the familiar procedure of completing the square, 
which nowadays leads to the usual formula for the solution of a quadratic. See Joseph 
(2000) for more on Babylonian mathematics. 

The ancient Greeks in effect solved quadratics by geometric constructions, but 
there is no sign of an algebraic formulation until at least AD 100 (Bourbaki 1969 
page 92). The Greeks also possessed methods for solving cubic equations, which 
involved the points of intersection of conics. Again, algebraic solutions of the cubic 
were unknown, and in 1494 Luca Pacioli ended his Summa di Arithmetica (Figure 1, 
right) with the remark that (in his archaic notation) the solution of the equations 
x 3 + mx = n and x 3 -n = mx was as impossible at the existing state of knowledge as 
squaring the circle. 

This state of ignorance was soon to change as new knowledge from the Middle 
and Far East swept across Europe and the Christian Church’s stranglehold on in¬ 
tellectual innovation began to weaken. The Renaissance mathematicians at Bologna 
discovered that the solution of the cubic can be reduced to that of three basic types: 
x 3 + px = q.x 3 = px + q, and x 3 +q = px. They were forced to distinguish these 
cases because they did not recognise the existence of negative numbers. It is thought, 
on good authority (Bortolotti 1925), that Scipio del Ferro solved all three types; he 
certainly passed on his method for one type to a student, Antonio Fior. News of the 
solution leaked out, and others were encouraged to try their hand. Solutions for the 
cubic equation were rediscovered by Niccolo Fontana (nicknamed Tartaglia, ‘The 
Stammerer’; Figure 2, left) in 1535. 

One of the more charming customs of the period was the public mathemati¬ 
cal contest, in which mathematicians engaged in mental duels using computational 
expertise as their weapons. Mathematics was a kind of performance art. Fontana 
demonstrated his methods in a public competition with Fior, but refused to reveal the 
details. Finally he was persuaded to tell them to the physician Girolamo Cardano, 
having first sworn him to secrecy. Cardano, the ‘gambling scholar’, was a mixture 
of genius and rogue, and when his Ars Magna (Figure 2, right) appeared in 1545, it 
contained a complete discussion of Fontana’s solution. Although Cardano claimed 
motives of the highest order (see the modern translation of his The Book of My Life , 
1931), and fully acknowledged Fontana as the discoverer, Fontana was justifiably 
annoyed. In the ensuing wrangle, the history of the discovery became public knowl¬ 
edge. 

The Ars Magna also contained a method, due to Ludovico Ferrari, for solving 
the quartic equation by reducing it to a cubic. Ferrari was one of Cardano’s students, 
so presumably he had given permission for his work to be published... or perhaps a 
student’s permission was not needed. All the formulas discovered had one striking 
property, which can be illustrated by Fontana’s solution x 3 + px = q : 
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FIGURE 2: Left. Niccolo Fontana (Tartaglia), who discovered how to solve cubic 
equations. Right : Title page of Girolamo Cardano’s Ars Magna. 


This expression, usually called Cardano’s formula because he was the first to publish 
it, is built up from the coefficients p and q by repeated addition, subtraction, mul¬ 
tiplication, division, and—crucially—extraction of roots. Such expressions became 
known as radicals. 

Since all equations of degree < 4 were now solved by radicals, it was natural to 
ask how to solve the quintic equation by radicals. Ehrenfried Walter von Tschirn- 
haus claimed a solution in 1683, but Gottfried Wilhelm Leibniz correctly pointed 
out that it was fallacious. Leonhard Euler failed to solve the quintic, but found new 
methods for the quartic, as did Etienne Bezout in 1765. Joseph-Louis Lagrange took 
a major step forward in his magnum opus Reflexions sur la Resolution Algebrique 
des Equations of 1770-1771, when he unified the separate tricks used for the equa¬ 
tions of degree < 4. He showed that they all depend on finding functions of the roots 
of the equation that are unchanged by certain permutations of those roots, and he 
showed that this approach fails when it is tried on the quintic. That did not prove that 
the quintic is insoluble by radicals, because other methods might succeed where this 
particular one did not. But the failure of such a general method was, to say the least, 
suspicious. 

A realisation that the quintic might not be soluble by radicals was now dawning. 
In 1799 Paolo Ruffini published a two-volume book Teoria Generate delle Equazioni 
whose 516 pages constituted an attempt to prove the insolubility of the quintic. Tig- 
nol (1988) describes the history, saying that ‘Ruffini’s proof was received with scepti¬ 
cism in the mathematical community.’ The main stumbling-block seems to have been 
the length and complexity of the proof; at any rate, no coherent criticisms emerged. 
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In 1810 Ruffini had another go, submitting a long paper about quintics to the French 
Academy; the paper was rejected on the grounds that the referees could not spare the 
time to check it. In 1813 he published yet another version of his impossibility proof. 
The paper appeared in an obscure journal, with several gaps in the proof (Bour- 
baki 1969 page 103). The most significant omission was to assume that all radicals 
involved must be based on rational functions of the roots (see Section 8.7). Nonethe¬ 
less, Ruffini had made a big step forward, even though it was not appreciated at the 
time. 

As far as the mathematical community of the period was concerned, the question 
was finally settled by Niels Henrik Abel in 1824, who proved conclusively that the 
general quintic equation is insoluble by radicals. In particular he filled in the big gap 
in Ruffini’s work. But Abel’s proof was unnecessarily lengthy and contained a minor 
error, which, fortunately, did not invalidate the method. In 1879 Leopold Kronecker 
published a simple, rigorous proof that tidied up Abel’s ideas. 

The ‘general’ quintic is therefore insoluble by radicals, but special quintic equa¬ 
tions might still be soluble. Some are: see Section 1.4. Indeed, for all Abel’s methods 
could prove, every particular quintic equation might be soluble, with a special for¬ 
mula for each equation. So a new problem now arose: to decide whether any partic¬ 
ular equation can be solved by radicals. Abel was working on this question in 1829, 
just before he died of a lung condition that was probably tuberculosis. 

In 1832 a young Frenchman, Evariste Galois, was killed in a duel. He had for 
some time sought recognition for his mathematical theories, submitting three mem¬ 
oirs to the Academy of Sciences in Paris. They were all rejected, and his work ap¬ 
peared to be lost to the mathematical world. Then, on 4 July 1843, Liouville ad¬ 
dressed the Academy. He opened with these words: 

I hope to interest the Academy in announcing that among the pa¬ 
pers of Evariste Galois I have found a solution, as precise as it is pro¬ 
found, of this beautiful problem: whether or not there exists a solution 
by radicals... 


The Life of Galois 

The most accessible account of Galois’s troubled life. Bell (1965), is also one 
of the less reliable, and in particular it seriously distorts the events surrounding his 
death. The best sources I know are Rothman (1982a, 1982b). For Galois’s papers and 
manuscripts, consult Bourgne and Azra (1962) for the French text and facsimiles of 
manuscripts and letters, and Neumann (2011) for English translation and parallel 
French text. Scans of the entire body of work can be found on the web at 
www.bibliotheque-institutdefrance.fr/numerisation/ 

Evariste Galois (Figure 3) was born at Bourg-la-Reine near Paris on 25 Octo¬ 
ber 1811. His father Nicolas-Gabriel Galois was a Republican (Kollros 1949)—that 
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is, he favoured the abolition of the monarchy. He was head of the village liberal 
party, and after the return to the throne of Louis XVIII in 1814, Nicolas became 
town mayor. Evariste’s mother Adelaide-Marie ( nee Demante) was the daughter of 
a jurisconsult—a legal expert who gives opinions about cases brought before them. 
She was a fluent reader of Latin, thanks to a solid education in religion and the clas¬ 
sics. 

For the first twelve years of his life, Galois was educated by his mother, who 
passed on to him a thorough grounding in the classics, and his childhood appears to 
have been a happy one. At the age of ten he was offered a place at the College of 
Reims, but his mother preferred to keep him at home. In October 1823 he entered 
a preparatory school, the College de Louis-le-Grand. There he got his first taste of 
revolutionary politics: during his first term the students rebelled and refused to chant 
in chapel. He also witnessed heavy-handed retribution, for a hundred of the students 
were expelled for their disobedience. 

Galois performed well during his first two years at school, obtaining first prize 
in Latin, but then boredom set in. He was made to repeat the next year’s classes, but 
predictably this just made things worse. During this period, probably as refuge from 
the tedium, Galois began to take a serious interest in mathematics. He came across 
a copy of Adrien-Marie Legendre’s Elements de Geometrie, a classic text which 
broke with the Euclidean tradition of school geometry. According to Bell (1965) 
Galois read it ‘like a novel’, and mastered it in one reading—but Bell is prone to 
exaggeration. Whatever the truth here, the school algebra texts certainly could not 
compete with Legendre’s masterpiece as far as Galois was concerned, and he turned 
instead to the original memoirs of Lagrange and Abel. At the age of fifteen he was 
reading material intended only for professional mathematicians. But his classwork 
remained uninspired, and he seems to have lost all interest in it. His rhetoric teachers 
were particularly unimpressed by his attitude, and accused him of affecting ambition 
and originality, but even his own family considered him rather strange at that time. 

Galois did make life very difficult for himself. For a start, he was was an untidy 
worker, as can be seen from some of his manuscripts (Bourgne and Azra 1962). 
Figures 4 and 5 are a sample. Worse, he tended to work in his head, committing only 
the results of his deliberations to paper. His mathematics teacher Vernier begged him 
to work systematically, no doubt so that ordinary mortals could follow his reasoning, 
but Galois ignored this advice. Without adequate preparation, and a year early, he 
took the competitive examination for entrance to the Ecole Polytechnique. A pass 
would have ensured a successful mathematical career, for the Polytechnique was the 
breeding-ground of French mathematics. Of course, he failed. Two decades later Olry 
Terquem (editor of the journal Nouvelles Annales des Mathematiques ) advanced the 
following explanation: ‘A candidate of superior intelligence is lost with an examiner 
of inferior intelligence. Because they do not understand me, I am a barbarian...’ To 
be fair to the examiner, communication skills are an important ingredient of success, 
as well as natural ability. We might counter Terquem with ‘Because I do not take 
account of their inferior intelligence, I risk being misunderstood.’ But Galois was 
too young and impetuous to see it that way. 

In 1828 Galois enrolled in an advanced mathematics course offered by Louis- 
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FIGURE 3: Portrait of Evariste Galois drawn from memory by his brother Alfred, 
1848. 


Paul-Emile Richard, who recognised his ability and was very sympathetic towards 
him. He was of the opinion that Galois should be admitted to the Polytechnique 
without examination—probably because he recognised the dangerous combination 
of high talent and poor examination technique. If this opinion was ever communi¬ 
cated to the Poly technique, it fell on deaf ears. 

The following year saw the publication of Galois’s first research paper (Galois 
1897) on continued fractions; though competent, it held no hint of genius. Mean¬ 
while, Galois had been making fundamental discoveries in the theory of polynomial 
equations, and he submitted some of his results to the Academy of Sciences. The ref¬ 
eree was Augustin-Louis Cauchy, who had already published work on the behaviour 
of functions under permutation of the variables, a central theme in Galois’s theory. 

As Rothman (1982a) says, ‘We now encounter a major myth.’ Many sources state 
that Cauchy lost the manuscript, or even deliberately threw it away, either to conceal 
its contents or because he considered it worthless. But Rene Taton (1971) found a 
letter written by Cauchy in the archives of the Academy. Dated 18 January 1830, it 
reads in part: 

I was supposed to present today to the Academy first a report on 
the work of the young Galoi [spelling was not consistent in those days] 
and second a memoir on the analytic determination of primitive roots 
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[by Cauchy]... Am indisposed at home. I regret not being able to attend 
today’s session, and I would like you to schedule me for the following 
session for the two indicated subjects. 

So Cauchy still had the manuscript in his possession, six months after Galois had 
submitted it. Moreover, he found the work sufficiently interesting to want to draw 
it to the Academy’s attention. However, at the next session of the Academy, on 25 
January, Cauchy presented only his own paper. What had happened to the paper by 
Galois? 

Taton suggests that Cauchy was actually very impressed by Galois’s researches, 
because he advised Galois to prepare a new (no doubt improved) version, and to sub¬ 
mit it for the Grand Prize in Mathematics—the pinnacle of mathematical honour— 
which had a March 1 deadline. There is no direct evidence for this assertion, but the 
circumstantial evidence is quite convincing. We do know that Galois made such a 
submission in February. The following year the journal Le Globe published an ap¬ 
peal for Galois’s aquittal during his trial for allegedly threatening the king’s life (see 
below): 


Last year before March 1, M. Galois gave to the secretary of the 
Institute a memoir on the solution of numerical equations. This mem¬ 
oir should have been entered in the competition for the Grand Prize in 
Mathematics. It deserved the prize, for it could resolve some difficulties 
that Lagrange had failed to do. Cauchy had conferred the highest praise 
on the author about this subject. And what happened? The memoir is 
lost and the prize is given without the participation of the young savant. 

Rothman points out that Cauchy fled France in September 1830, so the article is un¬ 
likely to have been based on Cauchy’s own statements. Le Globe was a journal of 
the Saint-Simonian organisation, a neo-Christian socialist movement founded by the 
Comte de Sainte-Simone. When Galois left jail, his closest friend Auguste Cheva¬ 
lier invited him to join a Saint-Simonian commune founded by Prosper Enfantin. 
Chevalier was a very active member and an established journalist. It is plausible that 
Chevalier wrote the article, in which case the original source would have been Ga¬ 
lois himself. If so, and if Galois was telling the truth, he knew that Cauchy had been 
impressed by the work. 

The same year held two major disasters. On 2 July 1829 Galois’s father com¬ 
mitted suicide after a bitter political dispute in which the village priest forged Nico¬ 
las’s signature on malicious epigrams aimed at his own relatives. It could not have 
happened at a worse time, for a few days later Galois again sat for entrance to the 
Polytechnique—his final chance. There is a legend (Bell 1965, Dupuy 1896) that 
he lost his temper and threw an eraser into the examiner’s face, but according to 
Bertrand (1899) this tradition is false. Apparently the examiner, Dinet, asked Galois 
some questions about logarithms. 

In one version of the story, Galois made some statements about logarithmic se¬ 
ries, Dinet asked for proofs, and Galois refused on the grounds that the answer was 
completely obvious. A variant asserts that Dinet asked Galois to outline the theory of 
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‘arithmetical logarithms’. Galois informed him, no doubt with characteristic blunt¬ 
ness, that there were no arithmetical logarithms. Dinet failed him. 

Was Galois right, though? It depends on what Dinet had in mind. The phrase 
‘arithmetical logarithms’ is not necessarily meaningless. In 1801 Carl Friedrich 
Gauss had published his epic Disquisitiones Arithmeticae, which laid the founda¬ 
tions of number theory for future generations of mathematicians. Ironically, Gauss 
had sent it to the French Academy in 1800, and it was rejected. In the Disquisitiones 
Gauss developed the notion of a primitive root modulo a prime. If g is a primitive root 
(mod p) then every nonzero element m (mod p ) can be written as a power m = g uiln ). 
Then a(mn) = a{m) +a(n), so a(m) is analogous to logm. Gauss called a(m) the in¬ 
dex of m to base g, and Article 58 of his book begins by stating that ‘Theorems 
pertaining to indices are completely analogous to those that refer to logarithms.’ So 
if this is what Dinet was asking about, any properly prepared candidate should have 
recognised it, and known about it. 

Because he had expected to be admitted to the Polytechnique, Galois had not 
studied for his final examinations. Now faced with the prospect of the Ecole Normale, 
then called the Ecole Preparatoire, which at that time was far less prestigious than the 
Polytechnique, he belatedly prepared for them. His performance in mathematics and 
physics was excellent, in literature less so; he obtained both the Bachelor of Science 
and Bachelor of Letters on 29 December 1829. 

Possibly following Cauchy’s recommendation, in February 1830 Galois pre¬ 
sented a new version of his researches to the Academy of Sciences in competi¬ 
tion for the Grand Prize in Mathematics. The manuscript reached the secretary 
Joseph Fourier, who took it home for perusal. But he died before reading it, and 
the manuscript could not be found among his papers. It may not have been Fourier 
who lost it, however; the Grand Prize committee had three other members: Legendre, 
Sylvestre-Franfois Lacroix, and Louis Poinsot. 

If the article in Le Globe is to be believed, no lesser a light than Cauchy had con¬ 
sidered Galois’s manuscript to have been worthy of the prize. The loss was probably 
an accident, but according to Dupuy (1896), Galois was convinced that the repeated 
losses of his papers were not just bad luck. He saw them as the inevitable effect of 
a society in which genius was condemned to an eternal denial of justice in favour of 
mediocrity, and he blamed the politically oppressive Bourbon regime. He may well 
have had a point, accident or not. 

At that time, France was in political turmoil. King Charles X succeeded Louis 
XVIII in 1824. In 1827 the liberal opposition made electoral gains; in 1830 more 
elections were held, giving the opposition a majority. Charles, faced with abdication, 
attempted a coup d’etat. On 25 July he issued his notorious Ordonnances suppressing 
the freedom of the press. The populace was in no mood to tolerate such repression, 
and revolted. The uprising lasted three days, after which as a compromise the Duke 
of Orleans, Louis-Philippe, was made king. During these three days, while the stu¬ 
dents of the Polytechnique were making history in the streets, Galois and his fellow 
students were locked in by Guigniault, Director of the Ecole Normale. Galois was 
incensed, and subsequently wrote a blistering attack on the Director in the Gazette 
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des Ecoles, signing the letter with his full name. An excerpt (the letter was published 
in December) reveals the general tone: 

Gentlemen: 

The letter which M. Guignault placed in the Lycee yesterday, on 
the account of one of the articles in your journal, seemed to me most 
improper. I had thought that you would welcome eagerly any way of 
exposing this man. 

Here are the facts which can be vouched for by forty-six students. 

On the morning of July 28, when several students of the Ecole Nor- 
male wanted to join in the stmggle, M. Guigniault told them, twice, that 
he had the power to call the police to restore order in the school. The 
police on the 28th of July! 

The same day, M. Guigniault told us with his usual pedantry: ‘There 
are many brave men fighting on both sides. If I were a soldier, I would 
not know what to decide. Which to sacrifice, liberty or LEGITIMACY?’ 

There is the man who the next day covered his hat with an enormous 
tricolor cockade. There are our liberal doctrines! 

The editor removed the signature, the Director was not amused, and Galois was ex¬ 
pelled because of his ‘anonymous’ letter (Dalmas 1956). 

Galois promptly joined the Artillery of the National Guard, a branch of the mili¬ 
tia composed almost entirely of Republicans. On 21 December 1830 the Artillery of 
the National Guard, almost certainly including Galois, was stationed near the Lou¬ 
vre, awaiting the verdict of the trial of four ex-minsters. The public wanted these 
functionaries executed, and the Artillery was planning to rebel if they received only 
life sentences. Just before the verdict was announced, the Louvre was surrounded by 
the full National Guard, plus other troops who were far more trustworthy. When the 
verdict of a jail sentence was heralded by a cannon shot, the revolt failed to materi¬ 
alise. On 31 December, the king abolished the Artillery of the National Guard on the 
grounds that it constituted a serious security threat. 

Galois was now faced with the urgent problem of making a living. On 13 January 
1831 he tried to set up as a private teacher of mathematics, offering a course in 
advanced algebra. Forty students enrolled, but the class soon petered out, probably 
because Galois was too involved in politics. On 17 January he submitted a third 
version of his memoir to the Academy: On the Conditions of Solubility of Equations 
by Radicals. Cauchy was no longer in Paris, so Simeon Poisson and Lacroix were 
appointed referees. After two months Galois had heard no word from them. He wrote 
to the President of the Academy, asking what was happening. He received no reply. 

During the spring of 1831, Galois’s behaviour became more and more extreme, 
verging on the paranoid. On April 18 Sophie Germain, one of the few women math¬ 
ematicians of the time, who studied with Gauss, wrote to Guillaume Libri about 
Galois’s misfortunes: ‘They say he will go completely mad, and I fear this is true.’ 
See Henry (1879). Also in April, 19 members of the Artillery of the National Guard, 
arrested after the events at the Louvre, were put on trial charged with attempting 
to overthrow the government. The jury acquitted them, and on 9 May a celebratory 
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banquet was held. About 200 Republicans were present, all extremely hostile to the 
government of Louis-Philippe. The proceedings became more and more riotous, and 
Galois was seen with a glass in one hand and a dagger in the other. His companions 
allegedly interpreted this as a threat to the king’s life, applauded mightily, and ended 
up dancing and shouting in the street. 

Next day, Galois was arrested. At his subsequent trial, he admitted everything, but 
claimed that the toast proposed was actually ‘To Louis-Philippe, if he turns traitor ,’ 
and that the uproar had drowned the last phrase. But he also made it crystal clear that 
he expected Louis-Philippe to do just that. Nevertheless, the jury acquitted him, and 
he was freed on 15 June. 

On 4 July he heard the fate of his memoir. Poisson declared it ‘incomprehensible’. 
The report (reprinted in full in Taton, 1947) ended as follows: 

We have made every effort to understand Galois’s proof. His rea¬ 
soning is not sufficiently clear, sufficiently developed, for us to judge its 
correctness, and we can give no idea of it in this report. The author an¬ 
nounces that the proposition which is the special object of this memoir is 
part of a general theory susceptible of many applications. Perhaps it will 
transpire that the different parts of a theory are mutually clarifying, are 
easier to grasp together rather than in isolation. We would then suggest 
that the author should publish the whole of his work in order to form a 
definitive opinion. But in the state which the part he has submitted to the 
Academy now is, we cannot propose to give it approval. 

The report may well have been entirely fair. Tignol (1988) points out that Galois’s en¬ 
try ‘did not yield any workable criterion to determine whether an equation is solvable 
by radicals.’ The referees’ report was explicit: 

[The memoir] does not contain, as [its] title promised, the condition 
of solubility of equations by radicals; indeed, assuming as true M. Ga¬ 
lois’s proposition, one could not derive from it any good way of deciding 
whether a given equation of prime degree is soluble or not by radicals, 
since one would first have to verify whether this equation is irreducible 
and next whether any of its roots can be expressed as a rational function 
of two others. 

The final sentence here refers to a beautiful criterion for solubility by radicals of 
equations of prime degree that was the climax of Galois’s memoir. It is indeed un¬ 
clear how it can be applied to any specific equation. Tignol says that ‘Galois’s theory 
did not correspond to what was expected, it was too novel to be readily accepted.’ 
What the referees wanted was some kind of condition on the coefficients that deter¬ 
mined solubility; what Galois gave them was a condition on the roots. Tignol sug¬ 
gests that the referees’ expectation was unreasonable; no simple criterion based on 
the coefficients has ever been found, nor is one remotely likely. But that was unclear 
at the time. See Chapter 25 for further discussion. 

On 14 July, Bastille Day, Galois and his friend Ernest Duchatelet were at the head 
of a Republican demonstration. Galois was wearing the uniform of the disbanded 
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Artillery and carrying a knife, several pistols, and a loaded rifle. It was illegal to 
wear the uniform, and even more so to be armed. Both men were arrested on the 
Pont-Neuf, and Galois was charged with the lesser offence of illegally wearing a 
uniform. They were sent to the jail at Sainte-Pelagie to await trial. While in jail, 
Duchatelet drew a picture on the wall of his cell showing the king’s head, labelled as 
such, lying next to a guillotine. This presumably did not help their cause. Duchatelet 
was tried first; then it was Galois’s turn. On 23 October he was tried and convicted, 
and his appeal was turned down on 3 December. By this time he had spent more 
than four months in jail. Now he was sentenced to six months there. He worked for 
a while on his mathematics (Figure 4 left); then in the cholera epidemic of 1832 he 
was transferred to a hospital. Soon he was put on parole. 



FIGURE 4: Left. First page of preface written by Galois when in jail. Right: Doodles 
left on the table before departing for the fatal duel. ‘Une femme’, with the second 
word scribbled out, can be seen near the lower left corner. 

Along with his freedom he experienced his first and only love-affair, with a 
certain Mile. ‘Stephanie D.’ From this point on the history becomes very compli¬ 
cated and conjectural. Until recently, the lady’s surname was unknown, adding to 
the romantic image of the. femme fatale. The full name appears in one of Galois’s 
manuscripts, but the surname has deliberately been scribbled over, no doubt by Ga¬ 
lois. Some forensic work by Carlos Infantozzi (1968), deciphering the name that Ga¬ 
lois had all but obliterated, led to the suggestion that the lady was Stephanie-Felicie 
Poterin du Motel, the entirely respectable daughter of Jean-Louis Auguste Poterin du 
Motel. Jean-Louis was resident physician at the Sieur Faultrier, where Galois spent 
the last few months of his life. The identification is plausible, but it relies on ex¬ 
tracting a sensible name from beneath Galois’s scribbles, so naturally there is a some 
controversy about it. 

In general, much mystery surrounds this interlude, which has a crucial bearing 





The Life of Galois 


13 


on subsequent events. Apparently Galois was rejected and took it very badly. On 25 
May he wrote to Chevalier: ‘How can I console myself when in one month I have 
exhausted the greatest source of happiness a man can have?’ On the back of one 
of his papers he made fragmentary copies of two letters from Stephanie (Tannery 
1908, Bourgne and Azra 1962). One begins ‘Please let us break up this affair’ and 
continues ’... and do not think about those things which did not exist and which never 
would have existed.’ The other contains the sentences ‘I have followed your advice 
and I have thought over what... has... happened... In any case. Sir, be assured there 
never would have been more. You’re assuming wrongly and your regrets have no 
foundation.’ 

Not long afterwards, Galois was challenged to a duel, ostensibly because of his 
advances towards the young lady. Again, the circumstances are veiled in mystery, 
though Rothman (1982a, 1982b) has lifted a corner of the veil. One school of thought 
(Bell, 1965; Kollros, 1949) asserts that Galois’s infatuation with Mile, du Motel was 
used by his political opponents, who found it the perfect excuse to eliminate their 
enemy on a trumped-up ‘affair of honour’. There are even suggestions that Galois 
was in effect assassinated by a police spy. 

But in his Memo ires, Alexandre Dumas says that Galois was killed by 
Pescheux D’Herbinville, a fellow Republican, see Dumas (1967). Dumas described 
D’Herbinville as ‘a charming young man who made silk-paper cartridges which he 
would tie up with silk ribbons.’ The objects concerned seem to have been an early 
form of cracker, of the kind now familiar at Christmas. He was one of the 19 Republi¬ 
cans acquitted on charges of conspiring to overthrow the government, and something 
of a hero with the peasantry. D’Herbinville was certainly not a spy for the police: 
all such men were named in 1848 when Caussidiere became chief of police. Dalmas 
(1956) cites evidence from the police report, suggesting that the other duellist was 
one of Galois’s revolutionary comrades, and the duel was exactly what it appeared to 
be. This theory is largely borne out by Galois’s own words on the matter (Bourgne 
and Azra, 1962): 

I beg patriots and my friends not to reproach me for dying otherwise 
than for my country. I die the victim of an infamous coquette. It is in a 
miserable brawl that my life is extinguished. Oh! why die for so trivial a 
thing, for something so despicable! ... Pardon for those who have killed 
me, they are of good faith. 

Figure 4 right shows a doodle by Galois with the words ‘Une femme’ partially 
crossed out. It does appear that Stephanie was at least a proximate cause of the duel, 
but very little else is clear. 

On 29 May, the eve of the duel, Galois wrote a famous letter to his friend Au¬ 
guste Chevalier, outlining his mathematical discoveries. This letter was eventually 
published by Chevalier in the Revue Encyclopedique. In it, Galois sketched the con¬ 
nection between groups and polynomial equations, stating that an equation is soluble 
by radicals provided its group is soluble. But he also mentioned many other ideas 
about elliptic functions and the integration of algebraic functions, and other things 
too cryptic to be identifiable. 
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The scrawled comment ‘I have no time’ in the margins (Figure 5) has given rise 
to another myth: that Galois spent the night before the duel frantically writing out 
his mathematical discoveries. However, that phrase has next to it ‘(Author’s note)’, 
which hardly fits such a picture; moreover, the letter was an explanatory accompani¬ 
ment to Galois’s rejected third manuscript, complete with a marginal note added by 
Poisson (Figure 6 left). 
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FIGURE 5: T have no time’ (je n ’ ai pas le temps), above deleted paragraph in lower 
left corner. But consider the context. 


The duel was with pistols. The post-mortem report (Dupuy 1896) states that they 
were fired at 25 paces, but the truth may have been even nastier. Dalmas reprints an 
article from the 4 June 1832 issue of Le Precursor, which reports: 

Paris, 1 June—A deplorable duel yesterday has deprived the exact 
sciences of a young man who gave the highest expectations, but whose 
celebrated precocity was lately overshadowed by his political activities. 

The young Evariste Galois... was fighting with one of his old friends, a 
young man like himself, like himself a member of the Society of Friends 
of the People, and who was known to have figured equally in a political 
trial. It is said that love was the cause of the combat. The pistol was the 
chosen weapon of the adversaries, but because of their old friendship 
they could not bear to look at one another and left the decision to blind 
fate. At point-blank range they were each armed with a pistol and fired. 
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Only one pistol was charged. Galois was pierced through and through 
by a ball from his opponent; he was taken to the hospital Cochin where 
he died in about two hours. His age was 22. L.D., his adversary, is a bit 
younger. 

Who was ‘L.D.’? Does the initial ‘D’ refer to d’Herbinville? Perhaps. ‘D’ is ac¬ 
ceptable because of the variable spelling of the period; the ‘L’ may have been a 
mistake. The article is unreliable on details: it gets the date of the duel wrong, and 
also the day Galois died and his age. So the initial might also be wrong. Rothman has 
another theory, and a more convincing one. The person who best fits the description 
here is not d’Herbinville, but Duchatelet, who was arrested with Galois on the Pont- 
Neuf. Bourgne and Azra (1962) give his Christian name as ‘Ernest’, but that might 
be wrong, or again the ‘L’ may be wrong. To quote Rothman: ‘we arrive at a very 
consistent and believable picture of two old friends falling in love with the same girl 
and deciding the outcome by a gruesome version of Russian roulette.’ 

This theory is also consistent with a final horrific twist to the tale. Galois was hit 
in the stomach, a particularly serious wound that was almost always fatal. If indeed 
the duel was at point-blank range, this is no great surprise. If at 25 paces, he was 
unlucky. 

He did not die two hours later, as Le Precursor says, but a day later on 31 May, 
of peritonitis; he refused the office of a priest. On 2 June 1832 he was buried in the 
common ditch at the cemetery of Montparnasse. 

His letter to Chevalier ended with these words (Figure 6 right): 

Ask Jacobi or Gauss publicly to give their opinion, not as to the 
truth, but as to the importance of these theorems. Later there will be, I 
hope, some people who will find it to their advantage to decipher all this 


mess... 
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FIGURE 6: Left: Marginal comment by Poisson. Right: The final page written by 
Galois before the duel. ‘To decipher all this mess’ (dechiffrer tout ce gdchis, is the 
next to last line). 





Chapter 1 

Classical Algebra 


In the first part of this book. Chapters 1-15, we present a (fairly) modern version of 
Galois’s ideas in the same setting that he used, namely, the complex numbers. Later, 
from Chapter 16 onwards, we generalise the setting, but the complex numbers have 
the advantages of being familiar and concrete. By initially restricting ourselves to 
complex numbers, we can focus on the main ideas that Galois introduced, without 
getting too distracted by ‘abstract nonsense’. 

A warning is in order. The decision to work over the complex numbers has advan¬ 
tages in terms of accessibility of the material, but it sometimes makes the discussion 
seem clumsy by comparison with the elegance of an axiomatic approach. This is ar¬ 
guably a price worth paying, because this way we appreciate the abstract viewpoint 
when it makes its appearance, and we understand where it comes from. However, it 
also requires a certain amount of effort to verify that many of the proofs in the com¬ 
plex case go through unchanged to more general fields—and that some do not, and 
require modification. 

We assume familiarity with the basic theory of real and complex numbers, but 
to set the scene, we recall some of the concepts involved. We begin with a brief 
discussion of complex numbers and introduce two important ideas. Both relate to 
subsets of the complex numbers that are closed under the usual arithmetic operations. 
A subring of the complex numbers is a subset closed under addition, subtraction, and 
mutliplication; a subfield is a subring that is also closed under division by any non¬ 
zero element. Both concepts were formalised by Richard Dedekind in 1871, though 
the ideas go back to Peter Gustav Lejeune-Dirichlet and Kronecker in the 1850s. 

We then show that the historical sequence of extensions of the number system, 
from natural numbers to integers to rationals to reals to complex numbers, can with 
hindsight be interpreted as a quest to make more and more equations have solutions. 
We are thus led to the concept of a polynomial, which is central to Galois theory 
because it determines the type of equation that we wish to solve. And we appreciate 
that the existence of a solution depends on the kind of number that is permitted. 

Throughout, we use the standard notation N,Z,Q,K,C for the natural numbers, 
integers, rationals, real numbers, and complex numbers. These systems sit inside 
each other: 

Nczcqcicc 

and each C symbol hints at a lengthy historical process in which ‘new numbers’ 
were proposed for mathematical reasons—usually against serious resistance on the 
grounds that although their novelty was not in dispute, they were not numbers and 
therefore did not exist. 
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1.1 Complex Numbers 

A complex number has the form 

z = x + iy 

where x.y are real numbers and r = — 1. Therefore i = \/ -1, in some sense. The 
easiest way to define what we mean by '/—l is to consider C as the set R 2 of all 
pairs of real numbers (. x,y ), with algebraic operations 

{x u yi) + (x 2 ,yi) = (xi+x 2 ,yi+y 2 ) (1 j. 

(xi,yi)(x 2 ,y 2 ) = {xix 2 -yiy 2 ,x\y 2 +x 2 yi) 

Then we identify (x.O) with the real number x to arrange that ECC, and define 
i = (0,1). In consequence, (. x,y ) becomes identified with x + iy. The formulas (1.1) 
imply that r = (0,1) (0,1) = (—1,0) which is identified with the real number -1, so 
i is a ‘square root of minus one’. Observe that (0,1) is not of the form (,v,0), so i is 
not real, which is as it should be, since — 1 has no real square root. 

This approach seems to have first been published by the Irish mathematician 
William Rowan Hamilton in 1837, but in that year Gauss wrote to the geometer 
Wolfgang Bolyai that the same idea had occurred to him in 1831. This was probably 
true, because Gauss usually worked things out before anybody else did, but he set 
himself such high standards for publication that many of his more important ideas 
never saw print under his name. Moreover, Gauss was somewhat conservative, and 
shied away from anything potentially controversial. 

Once we see that complex numbers are just pairs of real numbers, the previously 
mysterious status of the ‘imaginary’ number \J — 1 becomes much more prosaic. In 
fact, to the modern eye it is the ‘real’ numbers that are mysterious, because their 
rigorous definition involves analytic ideas such as sequences and convergence, which 
lead into deep philosophical waters and axiomatic set theory. In contrast, the step 
from R to R 2 is essentially trivial—except for the peculiarities of human psychology. 


1.2 Subfields and Subrings of the Complex Numbers 

For the first half of this book, we keep everything as concrete as possible—but 
not more so, as Albert Einstein is supposed to have said about keeping things simple. 
Abstract algebra courses usually introduce (at least) three basic types of algebraic 
structure, defined by systems of axioms: groups, rings, and fields. Linear algebra 
adds a fourth: vector spaces. For the first half of this book, we steer clear of abstract 
rings and fields, but we do assume the basics of finite group theory and linear algebra. 

Recall that a group is a set G equipped with an operation of ‘multiplication’ writ¬ 
ten (g,h) i —y gh. If g,h £ G then gh £ G. The associative law ( gh)k = g(hk) holds for 
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all gfi,k £ G. There is an identity 1 £ G such that 1 g = g = g 1 for all g £ G. Finally, 
every g £ G has an inverse g~ l £ G such that gg 1 = 1 = g 1 g. The classic example 
here is the symmetric group S„, consisting of all permutations of the set {1,2,... ,n} 
under the operation of composition. We assume familiarity with these axioms, and 
with subgroups, isomorphisms, homomorphisms, normal subgroups, and quotient 
groups. 

Rings are sets equipped with operations of addition, subtraction, and multiplica¬ 
tion; fields also have a notion of division. The formal definitions were supplied by 
Heinrich Weber in 1893. The axioms specify the formal properties assumed for these 
operations—for example, the commutative law ab = ba for multiplication. 

In the first part of this book, we do not assume familiarity with abstract rings and 
fields. Instead, we restrict attention to subrings and subfields of C, or polynomials 
and rational functions over such subrings and subfields. Informally, we assume that 
the terms ‘polynomial’ and ‘rational expression’ (or ‘rational function’) are familiar, 
at least over C, although for safety’s sake we define them when the discussion be¬ 
comes more formal, and redefine them when we make the whole theory more abstract 
in the second part of the book. There were no formal concepts of ‘ring’ or ‘field’ in 
Galois’s day and linear algebra was in a rudimentary state. He had to invent groups 
for himself. So we are still permitting ourselves a more extensive conceptual toolkit 
than his. 

Definition 1.1. A subring of C is a subset R C C such that 1 gS, and if x,y £ R then 

x+y, —x, and xy £ R. 

(The condition that 1 £ R is required here because we use ‘ring’ as an abbrevia¬ 
tion for what is often called a ‘ring-with -1 ’ or ‘unital ring’.) 

A subfield of C is a subring K C C with the additional property that if x £ K and 
x 7 ^ 0 thenx * 1 £ K. 

Here x“ 1 = 1 /x is the reciprocal. As usual we often write x/y for xy * 1 . 

It follows immediately that every subring of C contains 1 + (—1) =0, and is 
closed under the algebraic operations of addition, subtraction, and multiplication. A 
subfield of C has all of these properties, and is also closed under division by any 
nonzero element. Because R and K in Definition 1.1 are subsets of C, they inherit the 
usual rules for algebraic manipulation. 

Examples 1.2. (1) The set of all a + bi, for a . h £ Z, is a subring of C, but not a 
subfield. 

Since this is the first example we outline a proof. Let 
R = {a +bi: a,b £ Z} 

Since 1 = 1+ 0 i, we have 1 £R. Let x = a + bi,y = c + di £ R. Then 

x + y = (a + c) + (b + d)i £ R 
—x = — a — bi £ R 
xy = (ac — bd) + (ad + bc)i £ R 
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and the conditions for a subring are valid. However, 2 £ R but its reciprocal 2 _1 = 
\ R, so R is not a subfield. 

(2) The set of all a + bi, for a, b £ Q, is a subfield of C. 

Let 

K = {a + bi: a,b € Q} 

The proof is just like case (1), but now 


(a + bi) 1 


a 

a 2 +b 2 


b 

a 2 +b 2 


i£K 


so K is a subfield. 

(3) The set of all polynomials in K , with integer coefficients, is a subring of C, but 
not a subfield. 

(4) The set of all polynomials in K , with rational coefficients, is a subring of C. We 
can appeal to a result proved in Chapter 24 to show that this set is not a subfield. 
Suppose that = f(n) where / is a polynomial over Q. Then tzf{n) — 1=0, 
so n satisfies a nontrivial polynomial equation with rational coefficients, contrary to 
Theorem 24.5 of Chapter 24. 

(5) The set of all rational expressions in n with rational coefficients (that is, fractions 
p(n)/q(n) where p,q are polynomials over Q and q(n) 0) is a subfield of C. 

(6) The set 2Z of all even integers is not a subring of C, because (by our convention) 
it does not contain 1. 

(7) The set of all a + b\fl, for a , b £ Q, is not a subring of C because it is not closed 
under multiplication. However, it is closed under addition and subtraction. 


Definition 1.3. Suppose that K and L are subfields of C. An isomorphism between 
K and L is a map <j> : K —> L that is one-to-one and onto, and satisfies the condition 


<Kx+y) = <t>{x) + <t>(y) (t>{xy) = <t>(x)<t>(y) ( 1 . 2 ) 


for all x,y £ K. 

Proposition 1.4. If 0 : K —» L is an isomorphism, then: 

0 ( 0 ) = 0 
0 ( 1 ) = 1 
0 ( x) = -0(x) 

0(x“ 1 ) = (0(x)) _1 

Proof. Since Ox = 0 for all x € K , we have 0(0)0 (x) = 0(0) for all x £ K. Let x = 
0“ 1 (0). which exists since 0 is one-to-one and onto. Then 0(0).0 = 0(0), so 0 = 
0 ( 0 ). 

Since lx = x for all x £ K , we have 0 (1) 0 (x) = 0 (x) for all x £ K. Let x = 0 1 (1) 
to deduce that 0(1).1 = 1, so 0(1) = 1. 

Since x + (—x) = 0 for all x £ K, we have 0 (x) + 0 (—x) = 0 (0) = 0. Therefore 
0 ( x) = 0 (x). 

Since x.x -1 = 1 for all x £ K, we have 0(x).0(x _1 ) = 0(1) = 1. Therefore 
0(x- 1 ) = (0(x))- 1 . □ 
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If <j) satisfies (1.2) and is one-to-one but not necessarily onto, it is a monomor¬ 
phism. An isomorphism of K with itself is called an automorphism of K. 

Throughout the book we make extensive use of the following terminology: 

Definition 1.5. A primitive nth root of unity is an nth root of 1 that is not an with root 
of 1 for any proper divisor m of n. 

For example, i is a primitive fourth root of unity, and so is —i. Since (—l) 4 = 1, 
the number — 1 is a fourth root of unity, but it is not a primitive fourth root of unity 
because (— l) 2 = 1. 

Over C the standard choice for a primitive nth root of unity is 

= e 2 ™/" 

We omit the subscript n when this causes no ambiguity. 

The next result is standard, but we include a proof for completeness. 

Proposition 1.6. Let £ = c 2Ttl /". Then = e 2kK, / n is a primitive nth root of unity if 
and only ifk is prime to n. 

Proof. We prove the equivalent statement: C, k = e 2kltl / n is not a primitive nth root of 
unity if and only if k is not prime to n. 

Suppose that £* is not a primitive nth root of unity. Then (■ C, k ) m = 1 where m is 
a proper divisor of n. That is, n = mr where r > 1. Therefore C, km = 1, so mr = n 
divides km. This implies that r\k, and since also r\n we have (n,k) > r > 1, so k is 
not prime to n. 

Conversely, suppose that k is not prime to n, and let r > 1 be a common divisor. 
Then r\k and n = mr where m < n. Now km is divisible by mr = n, so {C, k ) m = 1. 
That is, £* is not a primitive nth root of unity. □ 

Examples 1.7. (1) Complex conjugation x + iy i —> x — iy is an automorphism of C. 
Indeed, if we denote this map by a, then: 

a((x + iy) + (u + iv)) = a((x+u) + i{y + v)) 

= (x + u) — i(y + v) 

= (x — iy) + (n — iv) 

= a(x + iy) + cc(u + iv) 
a((x + /y)(n + /v)) = a((xu — yv) + i{xv+yu)) 

= xu — yv — i (xv + yu) 

= (x — iy)(u — iv ) 

= a(x + iy)cc(u + iv) 

(2) Let K be the set of complex numbers of the form p + q\f2 , where p, q € Q. This 
is a subfield of C because 


(p + qV2)(p — q\/2) = p 2 -2q 2 
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so 


{p + qV 2 ) 1 


p _</ 

p 2 — 2 q 2 p 2 — 2 g 2 


if /; and <7 are non-zero. The map p + q \/2 1 —> p — 1/\/2 is an automorphism of /if. 
(3) Let a = v^2 € R, and let 


co = - 


1 

2 



be a primitive cube root of unity in C. The set of all numbers p + ga + ra 2 , for 
p,q,r £ Q, is a subfield of C, see Exercise 1.5. The map 


p + qa + ra 2 h> p + qcoa + rco 2 a 2 


is a monomorphism onto its image, but not an automorphism. Exercise 1.6. 


1.3 Solving Equations 

A physicist friend of mine once complained that while every physicist knew what 
the big problems of physics were, his mathematical colleagues never seemed to be 
able to tell him what the big problems of mathematics were. It took me a while to 
realise that this doesn’t mean that they didn’t know, and even longer to articulate 
why. The reason, I claim, is that the big problems of physics, at any given moment, 
are very specific challenges: measure the speed of light, prove that the Higgs boson 
exists, find a theory to explain high-temperature superconductors. Mathematics has 
problems like that too; indeed, Galois tackled one of them—prove that the quintic 
cannot be solved by radicals. But the big problems of mathematics are more general, 
and less subject to fashion (or disappearance by virtue of being solved). They are 
things like ‘find out how to solve equations like this one’, ‘find out what shape things 
like this are’, or even ‘find out how many of these gadgets can exist’. Mathematicians 
know this, but it is so deeply ingrained in their way of thinking that they seldom 
consciously recognise such questions as big problems. However, such problems have 
given rise to entire fields of mathematics—here, respectively, algebra, topology, and 
combinatorics. I mention this because it is the first of the above big problems that 
runs like an ancient river through the middle of the territory we are going to explore. 
Find out how to solve equations. Or, as often as not, prove that it cannot be done with 
specified methods. 

What sort of equations? For Galois: polynomials. But let’s work up to those in 
easy stages. 

The usual reason for introducing a new kind of number is that the old ones are 
inadequate for solving some important problem. Most of the historical problems in 
this area can be formulated using equations—though it must be said that this is a 
modern interpretation and the ancient mathematicians did not think in quite those 
terms. 
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For example, the step from N to Z is needed because although some equations, 
such as 

f + 2 = 7 

can be solved for t G N, others, such as 

t + 1 =2 

cannot. However, such equations can be solved in Z, where t = — 5 makes sense. 
(The symbol x is more traditional than t here, but it is convenient to standardise on t 
for the rest of the book, so we may as well start straight away.) 

Similarly, the step from Z to Q (historically, it was initially from N to Q + , the 
positive rationals) makes it possible to solve the equation 

2t = 7 

because t = j makes sense in Q. 

In general, an equation of the form 

at + b = 0 

where a,b are specific numbers and t is an unknown number, or ‘variable’, is called 
a linear equation. In a subfield of C, any linear equation with a ^ 0 can be solved, 
with the unique solution t = — b/a. 

The step from Q to R is related to a different kind of equation: 

t 1 = 2 

As the ancient Greeks understood (though in their own geometric manner—they did 
not possess algebraic notation and thought in a very different way from modern math¬ 
ematicians), the ‘solution’ t = \[2 is an irrational number—it is not in Q. (See Ex¬ 
ercise 1.2 for a proof, which may be different from the one you have seen before. It 
is essentially one of the old Greek proofs, translated into algebra. Paul Erdos used to 
talk of proofs being from ‘The Book’, by which he meant an alleged volume in the 
possession of the Almighty, in which only the very best mathematical proofs could 
be found. This Greek proof that the square root of 2 is irrational must surely be in The 
Book. An entirely different proof of a more general theorem is outlined in Exercise 
1.3.) 

Similarly, the step from R to C centres on the equation 

t 2 = -1 

which has no real solutions since the square of any real number is positive. 
Equations of the form 

at 2 +bt+c = 0 

are called quadratic equations. The classic formula for their solutions (there can be 
0 , 1 , or 2 of these) is of course 

—b ± \/b 2 — 4 ac 

t = 


2 a 
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and this gives all the solutions t provided the formula makes sense. For a start, we 
need a ^ 0. (If a = 0 then the equation is actually linear, so this restriction is not a 
problem.) Over the real numbers, the formula makes sense if b 2 — Aac > 0, but not 
if b 2 — Aac < 0. Over the complex numbers it makes sense for all a.b. c. Over the 
rationals, it makes sense only when b 2 — Aac is a perfect square—the square of a 
rational number. 


1.4 Solution by Radicals 

We begin by reviewing the state of the art regarding solutions of polynomial 
equations, as it was just before the time of Galois. We consider linear, quadratic, cu¬ 
bic, quartic, and quintic equations in turn. In the case of the quintic, we also describe 
some ideas that were discovered after Galois. Throughout, we make the default as¬ 
sumption of the period: the coefficients of the equation are complex numbers. 

Linear Equations 

Let a,b £ C with a / 0. The general linear equation is 

at + b = 0 


and the solution is clearly 


t = 


b 

a 


Quadratic Equations 

Let a,b,c £ C with a / 0. The general quadratic equation is 

at 2 +bt +c = 0 

Dividing by a and renaming the coefficients, we can consider the equivalent equation 

t~ at T b = 0 

The standard way to solve this equation is to rewrite it in the form 



Taking square roots. 
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so that 



which is the usual quadratic formula except for a change of notation. The process 
used here is called completing the square ; as remarked in the Historical Introduction, 
it goes back to the Babylonians 3600 years ago. 


Cubic Equations 

Let a,b,c £ C with a / 0. The general cubic equation can be written in the form 

/' T at~ T bt c = 0 


where again we have divided by the leading coefficient to avoid unnecessary compli¬ 
cations in the formulas. 

The first step is to change the variable to make a = 0. This is achieved by setting 
y = t + so that t =y— |. Such a move is called a Tschirnhaus transformation, after 
the person who first made explicit and systematic use of it. The equation becomes 

y 3 +py + q = 0 (1.3) 


where 


P = 


q = 


— a 2 + 3 b 

2 a 3 — 9ab + 21c 
27 


To find the solution(s) we try (rabbit out of hat) the substitution 


Now 

y 3 = m + v + 3v / mv / v(v / m + v/v) 

so that (1.3) becomes 

(u + v + q) + (y/u. + \/v)(3y/u\/v + p) =0 


We now choose u and v to make both terms vanish: 


which imply 


u + v + q = 0 
3\/uy/\’ + p = 0 


u + v = — 
mv = — 


q 

P^_ 

21 


(1.4) 

(1.5) 


( 1 . 6 ) 

(1.7) 
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Multiply (1.6) by u and subtract (1.7) to get 

p 3 

u(u + v) — uv = —qu + — 
which can be rearranged to give 

2 P* 

u +qu -=0 

27 

which is a quadratic. 

The solution of quadratics now tells us that 



Since u + v = —q we have 



Changing the sign of the square root just permutes u and v, so we can set the sign to 
+. Thus we find that 


y = 



(1.8) 


which (by virtue of publication, not discovery) is usually called Cardano’s formula. 
(This version differs from the formula in the Historical Introduction because Cardano 
worked with x 2 + px = q, so q changes sign.) Finally, remember that the solution t of 
the original equation is equal to y — a/3. 


Peculiarities of Cardano’s Formula 


An old Chinese proverb says ‘Be careful what you wish for: you might get it’. We 
have wished for a formula for the solution, and we’ve got one. It has its peculiarities. 

First: recall that over C every nonzero complex number z has three cube roots. If 
one of them is a , then the other two are coa and of a, where 


1 y/3 

C0 ~~2 +l ^ 


is a primitive cube root of 1. Then 


co 


2 


1 


The expression for y therefore appears to lead to nine solutions, of the form 

a + /3 a + cof} a + co 2 fi 

coa + J3 coa + cofi coa + co 2 fi 
co 2 a + [3 co 2 a + a>p co 2 a + co 2 fi 
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where a,p are specific choices of the cube roots. 

However, not all of these expressions are zeros. Equation (1.5) implies (1.7), but 
(1.7) implies (1.5) only when we make the correct choices of cube roots. If we choose 
a,p so that 3 a ft + p = 0, then the solutions are 

a + P coa + co 2 p co 2 a + coP 

Another peculiarity emerges when we try to solve equations whose solutions we 
already know. For example, 

y 3 + 3y — 36 = 0 

has the solution y = 3. Here p = 3,q = —36, and Cardano’s formula gives 
y = \AsW 325 + \/l8^/325 

which seems a far cry from 3. However, further algebra converts it to 3: see Exercise 
1.4. 

As Cardano observed in his book, it gets worse: if his formula is applied to 

f 3 — 15f —4 = 0 (1.9) 

it leads to 

t=\J 2 + 7^11+^2-\/-121 (1.10) 

in contrast to the obvious solution t = 4. This is very curious even today, and must 
have seemed even more so in the Renaissance period. 

Cardano had already encountered such baffling expressions when trying to solve 
the quadratic t( 10 — t) =40, with the apparently nonsensical solutions 5 + \/ -15 and 
5 — \/—15, but there it was possible to see the puzzling form of the ‘solution’ as ex¬ 
pressing the fact that no solution exists. However, Cardano was bright enough to spot 
that if you ignore the question of what such expressions mean , and just manipulate 
them as if they are ordinary numbers, then they do indeed satisfy the equation. ‘So,’ 
Cardano commented, ‘progresses arithmetic subtlety, the end of which is as refined 
as it is useless.’ 

However, this shed no light on why a cubic could possess a perfectly reasonable 
solution, but the formula (more properly, the equivalent numerical procedure) could 
not find it. Around 1560 Raphael Bombelli observed that (2 ± -1 ) 3 = 2 ± y 7 —121, 

and recovered (see Exercise 1.7) the solution t = 4 of (1.9) from the formula (1.10), 
again assuming that such expressions can be manipulated just like ordinary num¬ 
bers. But Bombelli, too, expressed scepticism that such manoeuvres had any sensi¬ 
ble meaning. In 1629 Albert Girard argued that such expressions are valid as formal 
solutions of the equations, and should be included ‘for the certitude of the general 
rules’. Girard was influential in making negative numbers acceptable, but he was way 
ahead of his time when it came to their square roots. 

In fact, Cardano’s formula is pretty much useless whenever the cubic has three 
real roots. This is called the ‘irreducible case’ of the cubic, and the traditional escape 
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route is to use trigonometric functions. Exercise 1.8. All this rather baffled the Re¬ 
naissance mathematicians, who did not even have effective algebraic notation, and 
were wary of negative numbers, let alone imaginary ones. 

Using Galois theory, it is possible to prove that the cube roots of complex num¬ 
bers that arise in the irreducible case of the cubic equation cannot be avoided. That 
is, there are no formulas in real radicals for the real and imaginary parts. See Van der 
Waerden (1953) volume 1 page 180, and Isaacs (1985). 

Quartic Equations 

An equation of the fourth degree 

/ 3 -)- at 2 bt~ -\- ct d — 0 

is called a quartic equation (an older term is biquadratic ). To solve it, start by making 
the Tschirnhaus transformation y = t + a/4, to get 

y 4 + py 2 + qy + r = 0 (1.11) 


where 


Rewrite this in the form 


p = b — 
q = c — 
r = d — 

.2 , P 


3 a 2 


8 

ab 

3 a 

-b 

_ 

2 

48 

ac 

a 2 b 

——b 


4 

16 


= ~qy~r+- 


Introduce a new term u, and observe that 


.2 , P 


.2 , P 


.2 , P 


y + - + u) — y + - +2 y + - )u+u 


= —qy — r + — + 2 uy 1 + pu + u 2 

We choose u to make the right hand side a perfect square. If it is, it must be the square 
of \f2uy — 2 j lu , and then we require 

2 1 
, P . , 2 4 

-r+ — +pu + u = — 

4 8m 

Provided u / 0, this becomes 

8m 3 + 8 pu 2 + (2/? — 8 r)u — q 2 = 0 


( 1 . 12 ) 
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which is a cubic in u. Solving by Cardano’s method, we can find u. Now 



so 

y~ + 1— + u = ± (^/2uy — \Fhi 

Finally, we can solve the above two quadratics to find y. 

If u = 0 we do not obtain (1.12), but if u = 0 then q = 0, so the quartic (1.11) is 
a quadratic in y 2 , and can be solved using only square roots. 

Equation (1.12) is called the resolvent cubic of (1.11). Explicit formulas for the 
roots can be obtained if required. Since they are complicated, we shall not give them 
here. 

An alternative approach to the resolvent cubic, not requiring a preliminary 
Tschirnhaus transformation, is described in Exercise 1.13. 


Quintic Equations 

So far, we have a series of special tricks, different in each case. We can start to 
solve the general quintic equation 

t 3 -\- at 4 bt 3 T ct~ T dt T e = 0 

in a similar way. A Tschirnhaus transformation y = t + a/5 reduces it to 

y 5 + py 3 +qy 2 + ry + .v = 0 

However, all variations on the tricks that we used for the quadratic, cubic, and quartic 
equations grind to a halt. 

In 1770-1771 Lagrange analysed all of the above special tricks, showing that 
they can all be ‘explained’ using general principles about symmetric functions of 
the roots. When he applied this method to the quintic, however, he found that it 
‘reduced’ the problem to a sextic—an equation of degree 6. Instead of helping, the 
method made the problem worse. A fascinating description of these ideas, together 
with a method for solving quintics whenever they are soluble by radicals, can be 
found in a lecture by George Neville Watson, rescued from his unpublished papers 
and written up by Berndt, Spearman and Williams (2002). The same article contains 
a wealth of other information about the quintic, including a long list of historical and 
recent references. Because the formulas are messy and the story is lengthy, the most 
we can do here is give some flavour of what is involved. 

Lagrange observed that all methods for solving polynomial equations by radicals 
involve constructing rational functions of the roots that take a small number of values 
when the roots (Xj are permuted. Prominent among these is the expression 

8= FI ( a J~ a k) 
l<j<k<n 


(U3) 
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where n is the degree. This takes just two values, ±5: plus for even permutations 
and minus for odd ones. Therefore A = 5 2 (known as the discriminant because it is 
nonzero precisely when the roots are distinct, so it ‘discriminates’ among the roots) 
is a rational function of the coefficients. This gets us started, and it yields a complete 
solution for the quadratic, but for cubics upwards it does not help much unless we 
can find other expressions in the roots with similar properties under permutation. 

Lagrange worked out what these expressions look like for the cubic and the quar- 
tic, and noticed a pattern. For example, if a cubic polynomial has roots cq, oq ■ oq and 
co is a primitive cube root of unity, then the expression 

a = (cq + coa 2 + or cq) 3 

takes exactly two distinct values. In fact, even permutations leave it unchanged, while 
odd permutations transform it to 

v = (cq + <o 2 a 2 + fticq) 3 

It follows that u + v and uv are fixed by all permutations of the roots, and must there¬ 
fore be expressible as rational functions of the coefficients. So m + v = a.uv = b 
where a,b are rational functions of the coefficients. Therefore it and v are the solu¬ 
tions of the quadratic equation t 2 — at + b = 0, so they can be expressed using square 
roots. But now the further use of cube roots expresses cq + OJOft + 0) 2 cq = \fu and 
cq + ft ) 2 «2 + ft)oq = -y/v by radicals. Since we also know that cq + cq + cq is minus 
the coefficient of t 2 , we have three independent linear equations in the roots, which 
are easily solved. 

Something very similar works for the quartic, with expressions like 

(cq -cq + cq-cq) 2 

But when we try the same idea on the quintic, an obstacle appears. Suppose that 
the roots of the quintic are cq , cq , cq , cq , cq . Let £ he a primitive fifth root of unity. 
Following Lagrange’s lead, it is natural to consider 

W = (ai+C«2 + C 2 «3 + C 3 «4 + C 4 «5) 5 

There are 120 permutations of 5 roots, and they transform vv into 24 distinct expres¬ 
sions. Therefore w is a root of a polynomial of degree 24—a big step in the wrong 
direction, since we started with a mere quintic. 

The best that can be done is to use an expression derived by Arthur Cayley in 
1861, based on an idea of Robert Harley in 1859. This expression is 

x = (cq cq + cqcq + cqcq + cqcq + cqcq - cq cq - cqcq - cqcq - cqcq - cqcq) 2 

It turns out that x takes precisely 6 values when the variables are permuted in all 
120 possible ways. Therefore x is a root of a sextic equation. The equation is very 
complicated and has no obvious roots; it is, perhaps, better than an equation of degree 
24, but it is still no improvement on the original quintic. Except when the sextic 
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happens, by accident, to have a root whose square is rational, in which case the 
quintic is soluble by radicals. Indeed, this is a necessary and sufficient condition for 
a quintic to be soluble by radicals, see Berndt, Spearman and Williams (2002). For 
instance, as they explain in detail, the equation 

t 4 5 + 15f + 12 = 0 


has the solution 


t 


5/— 75 +21\/l0 
V 125 


5 / —75 — 21 x/TO 
V 125 + 


5I225+I2VW 

V 125 


5I225-I2VIO 

V 125 


with similar expressions for the other four roots. 

Lagrange’s general method, then, fails for the quintic. This does not prove that 
the general quintic is not soluble by radicals, because for all Lagrange or anyone else 
knew, there might be other methods that do not make the problem worse. But it does 
suggest that there is something very different about the quintic. Suspicion began to 
grow that no method would solve the quintic by radicals. Mathematicians stopped 
looking for such a solution, and started looking for an impossibility proof instead. 


EXERCISES 

1.1 Use (1.1) to prove that multiplication of complex numbers is commutative and 
associative. That is, if u,v,w are complex numbers, then uv = vu and ( uv)w = 
u{vw). 

1.2 Prove that s/2 is irrational, as follows. Assume for a contradiction that there 
exist integers a,b, with b / 0, such that ( a/b) 2 = 2. 

1. Show that we may assume a,b> 0. 

2. Observe that if such an expression exists, then there must be one in which 
b is as small as possible. 

3. Show that 


4. Show that 2b — a > 0, a — b > 0. 

5. Show that a — b < b, a contradiction. 

1.3 Prove that if q £ Q then y 'q is rational if and only if q is a perfect square; that 
is, it can be written in the form q = p a / ■ ■ ■ pf/' where the integers cij , which 
may be positive or negative, are all even. 
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1.4* Prove without using Cardano’s formula that 

^18 + a/ 325 + \/l8 — \/3^ = 3 

1.5 Let a = \/2 £ R. Prove that the set of all numbers p + qa + ra 2 , for p,q,r£ Q, 
is a subfield of C. 

1.6 Let © be a primitive cube root of unity in C. With the notation of Exercise 1.5, 
show that the map 

p + qa + ra 2 h> p + qcoa + ror a 2 
is a monomorphism onto its image, but not an automorphism. 

1.7 Use Bombelli’s observation that (2 ± \/—I) 3 = 2 ± \/—121 to show that (with 
one choice of values of the cube roots) 

^2 + v/^T2T+y / 2-\/^T5T = 4 

1.8 Use the identity cos 30 = 4cos 3 0 — 3cos 0 to solve the cubic equation r 3 + 
pt + q = 0 when 27 q 2 +4p 3 < 0. 

1.9 Find radical expressions for all three roots of f 3 — 15f — 4 = 0. 

1.10 When 27q 2 +4/; 3 < 0 it is possible to try to make sense of Cardano’s formula 
by generalising Bombelli’s observation; that is, to seek a. j3 such that 



Why is this usually pointless? 

1.11* Let P(n) be the number of ways to arrange n zeros and ones in a row, given 
that ones occur in groups of three or more. Show that 

P{n) = 2P(n— 1) — P(n -2)4- P{n - 4) 

and deduce that as n -4 °° the ratio P p^ -4- x, where x > 0 is real and x 4 — 

2v 3 4- x 2 — 1 =0. Factorise this quartic as a product of two quadratics, and 
hence find x. 

1.12* The largest square that fits inside an equilateral triangle can be placed in any of 
three symmetrically related positions. Eugenio Calabi noticed that there is ex¬ 
actly one other shape of triangle in which there are three equal largest squares. 
Figure 7. Prove that in this triangle the ratio x of the longest side the other 
two is a solution of the cubic equation 2x 3 — 2x 2 — 3x + 2 = 0, and find an 
approximate value of x to three decimal places. 
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1.13 Investigate writing the general quartic t 4 + at 3 + bt 2 + ct + d in the form 

{t 2 + pt + q) 2 - (rt + s) 1 

which, being a difference of two squares, factorises into two quadratics 

(t 2 + pt + q + rt + s) ( t 2 +pt + q—rt — s ) 

and can thus be solved in radicals if p,q,r,s can be expressed in terms of the 
original coefficients a^b^c^d. 

Show that doing this leads to a cubic equation. 

1.14 Mark the following true or false. 

(a) -1 has no square root. 

(b) -1 has no real square root. 

(c) -1 has two distinct square roots in C. 

(d) Every subring of C is a subfield of C. 

(e) Every subfield of C is a subring of C. 

(f) The set of all numbers p + q\J 5 for p, q € Q is a subring of C. 

(g) The set of all numbers p + q\J 5 for p, q £ C is a subring of C. 

(h) Cardano’s formula always gives a correct answer. 

(i) Cardano’s formula always gives a sensible answer. 

(j) A quintic equation over Q can never be solved by radicals. 
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Chapter 2 

The Fundamental Theorem of Algebra 


At the time of Galois, the natural setting for most mathematical investigations was the 
complex number system. The real numbers were inadequate for many questions, be¬ 
cause — 1 has no real square root. The arithmetic, algebra, and—decisively—analysis 
of complex numbers were richer, more elegant, and more complete than the corre¬ 
sponding theories for real numbers. 

In this chapter we establish one of the key properties of C, known as the Fun¬ 
damental Theorem of Algebra. This theorem asserts that every polynomial equation 
with coefficients in C has a solution in C. This theorem is, of course, false over R. — 
consider the equation t 2 +1 = 0. It was fundamental to classical algebra, but the name 
is somewhat archaic, and modern algebra bypasses C altogether, preferring greater 
generality. Because we find it convenient to work in the same setting as Galois, the 
theorem will be fundamental for us. 

All rigorous proofs of the Fundamental Theorem of Algebra require quite a lot 
of background. Here, we give a proof that uses a few simple ideas from algebra and 
trigonometry, estimates of the kind that are familiar from any first course in analysis, 
and one simple basic result from point-set topology. 

Later, we give an almost purely algebraic proof, but the price is the need for much 
more machinery: see Chapter 23. Ironically, that proof uses Galois theory to prove 
the Fundamental Theorem of Algebra, the exact opposite of what Galois did. The 
logic is not circular, because the proof in Chapter 23 rests on the abstract approach 
to Galois theory described in the second part of this book, which makes no use of the 
Fundamental Theorem of Algebra. 


2.1 Polynomials 

Linear, quadratic, cubic, quartic, and quintic equations are examples of a more gen¬ 
eral class: polynomial equations. These take the form 

p(t) = o 

where p(t) is a polynomial in t. 

Mathematics is littered with polynomial equations, arising in a huge variety of 
contexts. As a sample, here are two from the literature. You don’t need to think about 
them: just observe them like a butterfly-collector looking at a strange new specimen. 


35 
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John Horton Conway came up with one of the strangest instances of a polynomial 
equation that I have ever encountered, in connection with the so-called look and say 
sequence. The sequence starts 

1 11 21 1211 111221 312211 13112221 ... 

The rule of formation is most readily seen in verbal form. We start with ‘1’, which 
can be read as ‘one one’, so the next term is 11. This reads ‘two ones’, leading to 
21. Read this as ‘one two, one one’ and you see where 1211 comes from, and so 
on. If L(n) is the length of the nth term in this sequence, approximately how big is 
L(n)? Conway (1985) proves that L(n ') satisfies a 72-term linear recurrence relation. 
Standard techniques from combinatorics then prove that for large n, the value of 
L(n) is asymptotically proportional to a", where a = 1-303577... is the smallest 
real solution of the 71st degree polynomial equation 


f 71 -/ 69 

-2 1 

68 _ .67 

+ 2 f 66 

+ 21 65 

— t 62> — f62 — t 6 ' 

oo 

in 

<N 

+ 

o 

VO 

+5 f 57 + 

3 f 56 

-2 f 55 - 

- 10f 54 

-3 f 53 

- 2f 52 + 6f 51 + 6 f 50 + f 49 + 9t 

-3f 47 - 

7f 46 

— 8f 45 - 

- 8f 44 + 10f 43 

+ 6f 42 + 8f 41 - 

5f 40 - 12f 39 

+7f 38 - 

7f 37 

+ 7 f 36 +f 35 - 

3f 34 + 

10f 33 +f 32 -6f 

31 -2 f 30 

-10 f 29 - 

- 3f 28 + 2f 27 

+ 9t 26 

— 3 f 25 

+ 14f 24 - 8f 23 - 

-It 2 ' +9f 20 

+3f 19 — 

4f 18 

-10f 17 

-It' 6 

+ 12f 15 +7f 14 + 2f 13 - 

- 12t' 2 — 4t" 


~2t 10 + 5f 9 + t 1 - It 6 + It 5 - 4 1 4 + 12r 3 - 6t 2 + 3t - 6 = 0 

The second example is from cosmology. Braden, Brown, Whiting, and York 
(1990) show that the entropy of a black hole is nr^a 2 , where a is a solution of 
the 7th degree equation 


t 5 {t-q 2 )(t-l) + b 2 (t 2 -q 2 ) 2 =0 (2.2) 

where b,q are expressions involving temperature and various fundamental physical 
constants such as the speed of light and Planck’s constant. 

With the importance of polynomial equations now established, we start to de¬ 
velop a coherent theory of their solutions. As the above examples illustrate, a poly¬ 
nomial is an algebraic expression involving the powers of a ‘variable’ or ‘indeter¬ 
minate’ t. We are used to thinking of such a polynomial as the function that maps t 
to the value of the expression concerned, so that the first polynomial represents the 
function / such that /(f) — t 2 — 2t + 6. This ‘function’ viewpoint is familiar, and it 
causes no problems when we are thinking about polynomials with complex numbers 
as their coefficients. Later (Chapter 16) we will see that when more general fields are 
permitted, it is not such a good idea to think of a polynomial as a function. So it is 
worth setting up the concept of a polynomial so that it extends easily to the general 
context. 

We therefore define a polynomial over C in the indeterminate t to be an expres¬ 
sion 

r o + r \t + • • • + r n t n 

where ro,..., r n e C, 0 < n £ Z, and t is undefined. What, though, is an ‘expression’. 
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logically speaking? For set-theoretic purity we can replace such an expression by 
the sequence (ro,.... r„). In fact, it is more convenient to use an infinite sequence 
(ro, r\ ,...) in which all entries rj = 0 when j > n for some finite n: see Exercise 2.2. 
In such a formalism, t is just a symbol for the sequence {0, 1,0 ...}. 

The elements ro, ..., r„ are the coefficients of the polynomial. In the usual way, 
terms 0 t m may be omitted or written as 0, and 1 t m can be replaced by t"'. 

In practice we often write polynomials in descending order 

fnf n J r r n ~\t n 1 H-+ r\t + ro 

and from now on we make such changes without further comment. 

Two polynomials are defined to be equal if and only if the corresponding co¬ 
efficients are equal, with the understanding that powers of t not occurring in the 
polynomial may be taken to have zero coefficient. To define the sum and the product 
of two polynomials, write 

Yj nt ' 

instead of 

ro + rifH-I -r n t n 

where the summation is considered as being over all integers i > 0, and r^ is defined 
to be 0 if k > n. Then, if 

r = X/,T s = ^s it ‘ 

we define 

r + s = £(r,- + Si)t' (2.3) 

and 

rs = Y,‘U tJ where <lj = L ri>s ‘ (2-4) 

h+i=j 

It is now easy to check directly from these definitions that the set of all polynomials 
over C in the t obeys all of the usual algebraic laws (Exercise 2.3). We denote this 
set by C[f], and call it the ring of polynomials over C in the indeterminate t. 

We can also define polynomials in several indeterminates f| .fi, • • • ,t n , obtaining 
the ring of /(-variable polynomials 


C[t\,t2, ■ ■ ■ dn\ 


in an analogous way. 

An element of C[f] will usually be denoted by a single letter, such as /, when¬ 
ever it is clear which indeterminate is involved. If there is ambiguity, we write f{t) 
to emphasise the role played by t. Although this looks like function notation, tech¬ 
nically it is not. However, polynomials over C can be interpreted as functions, see 
Proposition 2.3 below. 

Next, we introduce a simple but very useful concept, which quantifies how com¬ 
plicated a polynomial is. 

Definition 2.1. If / is a polynomial over C and / ^ 0, then the degree of / is the 
highest power of t occurring in / with non-zero coefficient. 
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For example, t 2 + 1 has degree 2, and 723f 1101 — 911 If 55 + 43 has degree 1101. 
The polynomial (2.1) has degree 71, and (2.2) has degree 7. 

More generally, if / = ]_r/t' and r„ / 0 and r m = 0 for m > n, then / has degree n. 
We write df for the degree of/. To deal with the case / = 0 we adopt the convention 

that d0 = -OO. This symbol is endowed with the following properties: — °° < n for any 

integer n, — °° + n = ——°° x n = —°°, (—°°) 2 = —°°. We do not set (—°°) 2 = +°° 
because 0.0 = 0. 

The following result is immediate from this definition: 

Proposition 2.2. Iff. g are polynomials over C, then 

d{f + g)<mzx(df,dg) d(fg) = df+dg 


□ 

The inequality in the first line is due to the possibility of the highest terms ‘can¬ 
celling’, see Exercise 2.4. 

The /(f) notation makes / appear to be a function, with f as its ‘independent vari¬ 
able’, and in fact we can identify each polynomial / over C with the corresponding 
function. Specifically, each polynomial / £ C[f] can be considered as a function from 
C to C, defined as follows: iff = £r,t' and a £ C, then a is mapped to The 

next proposition proves that when the coefficients lie in C, it causes no confusion if 
we use the same symbols / to denote a polynomial and the function associated with 
it. 

Proposition 2.3. Two polynomials f. g over C define the same function if and only 
if they are equal as polynomials; that is, they have the same coefficients. 

Proof Equivalently, by taking the difference of the two polynomials, we must prove 
that if /(f) is a polynomial over C and /(f) = 0 for all f, then the coefficients of / 
are all 0. Let P{n) be the statement: If a polynomial /(f) over C has degree n, and 
/(f) = 0 for all t £ C, then / = 0. We prove P{n ) for all n by induction on n. 

Both P(0) and P( 1) are obvious. Suppose that P(n — 1) is true. Write 

/(f) = a n t n -|-+£fo 

In particular, /(0) = 0, so ao = 0 and 

/(f) = a n t n + • • • + cqf 

= f(u+ ^, + * * * +^q) 

= tg{t) 

where g(t) = a„t"~ l H - \-a\ has degree n — 1. Now g(t) vanishes for all f £ C 

except, perhaps, f = 0. However, if g(0) =a\ f 0 then g(t) f 0 for f sufficiently small. 
(This follows by continuity of polynomial functions, but it can be proved directly by 
estimating the size of g(e) when e is small.) Therefore g(t) vanishes for all f £ C. By 
induction, g = 0. Therefore / = 0, so P(n) is true and the induction is complete. □ 
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Proposition 2.3 implies that we can safely consider a polynomial over a subfield 
of C as either a formal algebraic expression or a function. It is easy to see that sums 
and products of polynomials agree with the corresponding sums and products of 
functions. Moreover, the same notational flexibility allows us to ‘change the variable’ 
in a polynomial. For example, if t, u are two indeterminates and f(t) = f_rjt', then 
we may define f{u) = Y. r i ll ‘■ It is also clear what is meant by such expressions as 
/(f-3)or/(t 2 + l). 


2.2 Fundamental Theorem of Algebra 

In Section 1.3 we saw that the development of the complex numbers can be 
viewed as the culmination of a series of successive extensions of the natural num¬ 
ber system. At each step, equations that cannot be solved within the existing number 
system become soluble in the new, extended system. For example, C arises from R. 
by insisting that t 2 = — 1 should have a solution. 

The question then arises: why stop at C? Why not find an equation that has no 
solutions over C, and enlarge the number system still further to provide a solution? 

The answer is that no such equation exists, at least if we limit ourselves to poly¬ 
nomials. Every polynomial equation over C has a solution in C. This proposition was 
a matter of heated debate around 1700. In a paper of 1702, Leibniz disputes that it 
can be true, citing the example 

x 4 + a 4 = ^ x+a\j yf—\ 

and presumably thinking that \Jy/— \ is not a complex number. 

However, in 1676 Isaac Newton had already observed the factorisation into real 
quadratics: 

x ■ a == (x a ) —2 a x — (x T a T \/2ux) (x T a~ — \/2ux) 

and Nicholas Bernoulli published the same formula in 1719. In effect, the resolution 
of the dispute rests on observing that \fi = which is in C. In fact, every complex 
number has a complex square root: 

/- z-. a+\/a 2 + b 2 . -a+ Va 2 +b 2 

Va + bi=]f - ----- (2.5) 

(together with minus the same formula), as can be checked by squaring the right- 
hand side. Here the square root of a 2 + b 2 is the positive one, and the signs of the 
other two square roots are chosen to make their product equal to b. Observe that 



a + V a 2 + b 2 > 0 


— a + \/ a 2 +b 2 > 0 
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because a 2 +b 2 > a 2 , so both of the main square roots on the right-hand side are real. 

In 1742 Euler asseited, without proof, that every real polynomial can be decom¬ 
posed into linear or quadratic factors with real coefficients; Bernoulli now erred the 
other way, citing 

x 4 — 4x 3 + 2x 2 + 4x + 4 

with zeros 1 + + 1 — \p2 i+~\7—I, 1 + \/2 — \f^%, and 1 — 'J'l — \f—%. 

Euler responded, in a letter to his friend Christian Goldbach, that the four factors 
occur as two complex conjugate pairs, and that the product of such a pair of factors 
is a real quadratic. He showed this to be the case for Bernoulli’s proposed counterex¬ 
ample. Goldbach suggested that x 4 + 12x — 20 did not agree with Euler’s assertion, 
and Euler pointed out a computational error, adding that he had proved the theorem 
for polynomials of degree < 6. Euler and Jean Le Rond d’Alembert gave incomplete 
proofs for any degree; Lagrange claimed to have filled in the gaps in Euler’s proof 
in 1772, but made the mistake of assuming that the roots existed, and using the laws 
of algebra to deduce that they must be complex numbers, without proving that the 
roots—whatever they were—must obey the laws of algebra. The first genuine proof 
was given by Gauss in his doctoral thesis of 1799. It involved the manipulation of 
complicated trigonometric series to derive a contradiction, and was far from trans¬ 
parent. The underlying idea can be reformulated in topological terms, involving the 
winding number of a curve about a point, see Hardy (1960) and Stewart (1977). Later 
Gauss gave three other proofs, all based on different ideas. 

Other classical proofs use deep results in complex analysis, such as Liouville’s 
Theorem: a bounded function analytic on the whole of the complex plane is constant. 
This depends on Cauchy’s Integral Formula and takes most of a course in complex 
analysis to prove. See Titchmarsh (1960). An alternative approach uses Rouche’s 
Theorem, Titchmarsh (1960) 3.44. Another proof uses the Maximum Modulus The¬ 
orem: if an analytic function is not constant, then the maximum value of its modulus 
on an arbitrary set occurs on the boundary of that set. A variant uses the Minimum 
Modulus Theorem (the minimum value of its modulus on an arbitrary set is either 
zero or occurs on the boundary of that set). See Stewart and Tall (1983) Theorems 
10.14, 10.15. Euler’s approach, which sets the real and imaginary parts of p(z) to 
zero and proves that the resulting curves in the plane must intersect, can be made 
rigorous. William Kingdon Clifford gave a proof based on induction on the power of 
2 that divides the degree n, which is most easily explained using Galois theory. We 
present this in Chapter 23, Corollary 23.13. 

All of these proofs are quite sophisticated. But there’s an easier way, using a few 
ideas from elementary point-set topology and estimates of the kind we encounter 
early on in any course on real analysis. It can be found on Wikipedia, and it deserves 
to be more widely known because it is simple and cuts straight to the heart of the 
issue. The necessary facts can be proved directly by elementary means, and would 
have been considered obvious before mathematicians started worrying about rigour 
in analysis around 1850. So Euler, Gauss, and other mathematicians of those periods 
could have discovered this proof. 

We now state this property of the complex numbers formally, and explore some of 
its easier consequences. It is the aforementioned Fundamental Theorem of Algebra. 
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As we have observed, this is a good name if we are thinking of classical algebra, but 
not such a good name in the context of modern abstract algebra, which constructs 
suitable fields as it goes along and avoids explicit use of complex numbers. 

Theorem 2.4 (Fundamental Theorem of Algebra). If p(z) is a non-constant poly¬ 
nomial over C, then there exists zo £ C such that p(zo) = 0. 

Such a number z is called a root of the equation pit) = 0, or a zero of the poly¬ 
nomial p. For example, i is a root of the equation t 2 + 1 =0 and a zero of t 2 + 1. 
Polynomial equations may have more than one root; indeed, t 2 + 1 = 0 has at least 
one other root, —i. 

The idea behind the proof is illustrated in Figure 8, and can be summarised in 
a few lines. Assume for a contradiction that p(z) is never zero. Then |/? (z) | 2 has a 
nonzero minimum value and attains that minimum at some point w £ C. Consider 
points v on a small circle centred at w, and use simple estimates to show that |p(v)| 2 
must be less than \p{w)\ 2 for some v. Contradiction. 

Now for the details. 

Proof of Theorem 2.4. Suppose for a contradiction that no such zo exists. For some 
R> 0 the set 

®={z: |p(z)| 2 </?} 

is non-empty. The map t// : C — > R + defined by i jf(z) = \p{z)\ 2 is continuous, so 
S> = i// 1 ([()./(]) is compact. For a subset of C this is equivalent to being closed and 
bounded. It follows that |p(z )| 2 attains its minimum value on S>. By the definition of 
this is also its minimum value on C. 

Assume this minimum is attained at w £ C. Then 

\p(z )\ 2 > \p(w )| 2 

for all z £ C, and by assumption p(w) 0. 
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We now consider \p(z)\ 2 as z runs round a small circle centred at w, and derive a 
contradiction. 

Let h £ C. Expand p(w + h) in powers of h to get 

p(w + h) = p Q +pih + p 2 h 2 H-h p„h n (2.6) 

where n is the degree of p. Here the pj are specific complex numbers. They are in 
fact the Taylor series coefficients 

Pj = p ij \w)/jl 

but we don’t actually need to use this, and (2.6) can be proved algebraically without 
difficulty. 

Clearly po = p(w), and we are assuming this is nonzero, so po f 0. If p\ = p 2 : 
••• = p n = 0 then p(z) = po is constant, contrary to hypothesis. So some pj / 0. Let 
m be the smallest integer > 1 from which p m / 0. In (2.6) let h = ee' e for small 
e > 0. Then 

p(w + ee 10 ) =p 0 + p m £ m e mie + 0(e m+1 ) 
where ()(£") indicates terms of order n or more in e. Therefore 

|p(w + ee ,e )| 2 = \po + p m £ m e mie \ 2 + 0(e m+1 ) 

= P0P0 + PoPn,e m e mi9 + P0P m £ m e-' nie + 0(e m+1 ) 

Let pop,,, = for r > 0. Since po ^ 0 and p m / Owe have r > 0. Setting /z = 0 we 
see that poPo = \pt w )\ 2 - Now 

\p(w + ee'' e )| 2 = poPo + r^e m s mie + r^ £ m e~ mie + <9(e m+1 ) 

= \p(w)\ 2 + 2e m rcos(mQ + <!>) + O (e m+1 ) 

Set 6 = ^(0 — n), so that <j> = % — m0. Then cos(zzz0 + 0) = cos(7r) = — 1, and 

| p(w + ee ' 9 )| 2 = \p(w)\ 2 - 2e m r + 0(e m+1 ) 

But e,r > 0, so for sufficiently small e we have 

\ P (w + £C i6 )\ 2 <\ P (W )\ 2 

contradicting the definition of vv. Therefore there exists zq S C such that p(z o) = 
0. □ 


2.3 Implications 

The Fundamental Theorem of Algebra has some useful implications. Before 
proving the most basic of these, we first prove the Remainder Theorem. 
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Theorem 2.5 (Remainder Theorem). Let p(t) £ C[f] with dp> 1, and let a £ C. 

(1) There exist q(t) £ C[f] and r £ C such that p(t ) = (t — Ct)q(t) + r. 

(2) 77;e constant r satisfies r = p(cc). 

Proof. Let y = t — a so that t = y + a. Write p(t) = p n t n H- \- po where p n 0 

and n > 1. Then 

p(t) = Pn (y + a) n -{ - hpo 

Expand the powers of y + a by the binomial theorem, and collect terms to get 

p(t) = a n y"-\ -l-fliy + ao aj£ C 

= y{a n y n 1 H-|-ai) +«o 

= (t - a)q{t)+r 

where 

q(t) = a n (t — a) n ~ l H- ha 2 {t — a) +aiO 

r = a 0 

Now substitute t = a in the identity p(t) = (t — a)q(t) + r to get 
p(a) = (a — a)q(a) + r = 0 ,q(a) + r = r 


□ 

Corollary 2.6. The complex number a is a zero of p{t) if and only if t — cc divides 
p(t ) in C[/]. 

Proposition 2.7. Let p(t ) £ C[t) with dp = n > 1. Then there exist a,\,...,a n £C, 
and 0 k £ C, such that 


p(t)=k(t-a 1 )...(t-a n ) (2.7) 

Proof Use induction on n. The case n = 1 is obvious. If n > 1 we know, by the 
Fundamental Theorem of Algebra, that p(t) has at least one zero in C: call this zero 
a n . By the Remainder Theorem, there exists q(t) £ C[t] such that 

p(t) = (t - a n )q(t) ( 2 . 8 ) 

(note that the remainder r = p(a„) =0). Then dq = n — 1, so by induction 


q(t)=k(t-a l )...(t-a n - l ) 


(2.9) 


For suitable complex numbers k,a\.. .., a„_ i. Substitute (2.9) in (2.8) and the induc¬ 
tion step is complete. □ 
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It follows immediately that the a,j are the only complex zeros of p{1). 

The zeros a ; need not be distinct. Collecting together those that are equal, we 
can rewrite (2.7) in the form 

where k = a„ , the j Bj are distinct, the nij are integers > 1, and m\ H -b»q = n. We 

call nij the multiplicity of the zero fij of p(t). 

In particular, we have proved that every complex polynomial of degree n has 
precisely n complex zeros, counted according to multiplicity. 


EXERCISES 

2.1 Let p(t) £ Q[f]. Show that p(t) has a unique expression in the form 

p(t) = (t. - af).. .{t - a r )q(t) 

(except for re-ordering the af) where a,- £ Q for 1 <j< r and q{t) has no 
zeros in Q[f], Prove that here, the aj are precisely the zeros of p(t) in Q. 

2.2 A formal definition of C[f] runs as follows. Consider the set S of all infinite 
sequences 

(fn )«6N = ,... • • •) 

where a„ £ C for all n £ N, and such that a„ = 0 for all but a finite set of n. 
Define operations of addition and multiplication on S by the rules 

(a n ) T ( b n ) — {ujf) where u n — an bn 

{a n )(s n ) = (v„) where v„ = a n bo +a n -\b\ 4- \-aob„ 

Prove that C[f], so defined, satisfies all of the usual laws of algebra for addition, 
subtraction, and multiplication. Define the map 

6 :C -T S 

e(k) = (k, o,o,o,...) 

and prove that 0(C) C S is isomorphic to C. 

Finally, prove that if we identify a £ C with 0 (a) £ S and the ‘indeterminate’ 

t with (0, 1,0, 0, 0,...) £ S , then (a„) = ao H-b ciNt N , where N is chosen 

so that a n = 0 for n > N. Thus we can define polynomials as sequences of 
complex numbers corresponding to the coefficients. 

2.3 Using (2.3, 2.4), prove that polynomials over C obey the following algebraic 
laws: 

/ + 8 = g + f,f+ (g + h) = (/ + g) + h, fg = gf, f(gh ) = (fg)h, and f{g + 
h)=fg+fh. 
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2.4 Show that d(f + g) can be less than max(<9/, dg), and indeed that d(f + g) 
can be less than min(<9/, dg). 

2.5* If zi,Z2, ■ ■ ■ ,Zn are distinct complex numbers, show that the determinant 


D = 


1 

zi 


1 ••• 1 

Z2 * * ' Zn 


n-1 n— 1 ,n-l 

z 1 Z 2 z„ 


is non-zero. 

(Hint: Consider the Zj as independent indeterminates over C. Then D is a poly¬ 
nomial in the Zj, of total degree 0+1+2-I-(- (n — 1) = \n(n — 1). More¬ 

over, D vanishes whenever Zj = Zk, for k / j, since it then has two identical 
rows. Therefore D is divisible by Zj — Zk for all j / k, hence it is divisible by 
Y\j<k{z.j — Zk)- Now compare degrees.) 

The determinant D is called a Vandermonde determinant , for obscure reasons 
(no such expression occurs in Alexandre-Theophile Vandermonde’s published 
writings). 

2.6 Use the Vandermonde determinant to prove that if a polynomial /(f) vanishes 
for all t £ C, then all coefficients of / are zero. (Hint. Substitute t = 1,2,3,... 
and solve the resulting system of linear equations for the coefficients.) 

2.7 Prove, without using the Fundamental Theorem of Algebra, that every cubic 
polynomial over R can be expressed as a product of linear factors over C. 

2.8* Do the same for cubic polynomials over C. 

2.9 Mark the following true or false. Here /.g are polynomials over C. 

(a) d(f-g) > min(df,dg). 

(b) d(f-g) <min(df,dg). 

(c) d(f-g) <max(df,dg). 

(d) d(f-g) >max(df,dg). 

(e) Every polynomial over C has at least one zero in C. 

(f) Every polynomial over C of degree > 1 has at least one zero in R. 
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Chapter 3 

Factorisation of Polynomials 


Not only is there an algebra of polynomials: there is an arithmetic. That is, there 
are notions analogous to the integer-based concepts of divisibility, primes, prime 
factorisation, and highest common factors. These notions are essential for any serious 
understanding of polynomial equations, and we develop them in this chapter. 

Mathematicians noticed early on that if / is a product gh of polynomials of 
smaller degree, then the solutions of f(t) = 0 are precisely those of g(t ) = 0 together 
with those of h(t) = 0. For example, to solve the equation 

f 3 - 6f 2 + 1 If — 6 = 0 

we can spot the factorisation (f — l)(f — 2)(f — 3) and deduce that the roots are t = 
1,2,3. From this simple idea emerged the arithmetic of polynomials—a systematic 
study of divisibility properties of polynomials with particular reference to analogies 
with the integers. In particular, there is an analogue for polynomials of the Euclidean 
Algorithm for finding the highest common factor of two integers. 

In this chapter we define the relevant notions of divisibility and show that there 
are certain polynomials, the ‘irreducible’ ones, that play a similar role to prime num¬ 
bers in the ring of integers. Every polynomial over a given subfield of C can be 
expressed as a product of irreducible polynomials over the same subfield, in an es¬ 
sentially unique way. We relate zeros of polynomials to the factorisation theory. 

Throughout this chapter all polynomials are assumed to lie in K[t], where K is 
a subfield of the complex numbers, or in R[t], where R is a subring of the complex 
numbers. Some theorems are valid over R , while others are valid only over K: we 
will need both types. 


3.1 The Euclidean Algorithm 

In number theory, one of the key concepts is divisibility: an integer a is divisible 
by an integer b if there exists an integer c such that a = be. For instance, 60 is divisible 
by 3 since 60 = 3.20, but 60 is not divisible by 7. Divisibility properties of integers 
lead to such ideas as primes and factorisation. We wish to develop similar ideas for 
polynomials. 

Many important results in the factorisation theory of polynomials derive from the 
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observation that one polynomial may always be divided by another provided that a 
‘remainder’ term is allowed. This is a generalisation of the Remainder Theorem, in 
which / is assumed to be linear. 

Proposition 3.1 (Division Algorithm). Let f and g be polynomials over K, and 
suppose that f is non-zero. Then there exist unique polynomials q and r over K, such 
that g = fq + r and r has strictly smaller degree than f. 

Proof. Use induction on the degree of g. If dg = — °° then g = 0 and we may take 
q = r = 0. If dg = 0 then g = k is an element of K. If also df = 0 then / is an element 
of K, and we may take q = k/f and r = 0. Otherwise df > 0 and we may take q = 0 
and r = g. This starts the induction. 

Now assume that the result whenever the degree of g is less than n, and let dg = 
n > 0. If df > dg, then we may as before take q = 0. r = g. Otherwise 

/ = cimi" 1 H- \-oo 8 = b n t n H-b 

where a m ^f Of b„ and m < n. Let 

gi = b n a~ 1 t n ~ m f-g 

Since the terms of highest degree cancel (which is the object of the exercise) we 
have dg i < dg. By induction there are polynomials q\ and r\ over K such that g i = 
fqi + r\ and dr\ < df. Let 

q = b n a~ 1 t n ~ m -qi r = -r\ 

Then 

fq + r = b n a m 1 t n ~ m f — q\f — r\ =g + g i -gi=g 

so g = .fq + r, clearly dr < df as required. 

Finally we prove uniqueness. Suppose that 

g = fqi +r\ = fqi + r 2 where dr u dr 2 <df 

Then f(q i — q 2 ) = r 2 — r\. By Proposition 2.2, the polynomial on the left has higher 
degree than that on the right, unless both are zero. Since f f 0 we must have q\ = q 2 
and r i = r 2 . Thus q and r are unique. □ 

With the above notation, q is called the quotient and r is called the remainder 
on dividing g by /. The inductive process we employed to find q and r is called the 
Division Algorithm. 

Example 3.2. Divide g(t) = t 4 — 7f 3 + 5 1 2 + 4 by / = t 2 + 3 and find the quotient 
and remainder. 

Observe that 

t 2 (t 2 + 3) =t 4 + 3t 2 

has the same leading coefficient as g. Then 

g-t 2 (t 2 + 3) = -7r 3 +2t 2 +4 
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which has the same leading coefficient as 

-lt(t 2 +3) = —7f 3 —21 1 

Therefore 

g-t 2 {t 2 + 3)+lt(t 2 + 3) =2f 2 + 21f+4 
which has the same leading coefficient as 

2(f 2 + 3) = 2f 2 +6 


Therefore 

g — t 2 (t 2 +3) + lt(t 2 + 3) — 2(t 2 + 3) = 2 If — 2 


So 


and the quotient q(t) 


g = (t 2 + 3)(f 2 — 7f +2) + (21f — 2) 
f 2 — It + 2, while the remainder r(f) = 21f — 2. 


The next step is to introduce notions of divisibility for polynomials, and in par¬ 
ticular the idea of ‘highest common factor’ which is crucial to the arithmetic of poly¬ 
nomials. 


Definition 3.3. Let / and g be polynomials over K. We say that f divides g (or / is 
a factor of g, or g is a multiple of /) if there exists some polynomial h over K such 
that g = fh. The notation f\g will mean that / divides g, while f\g will mean that / 
does not divide g. 

Definition 3.4. A polynomial d over K is a highest common factor (hcf) of poly¬ 
nomials / and g over K if d\f and d\g and further, whenever e\f and e\g, we have 
e\d. 

Note that we have said a highest common factor rather than the highest common 
factor. This is because hcf’s need not be unique. However, the next lemma shows that 
they are unique apart from constant factors. 

Lemma 3.5. Ifd is an hcf of the polynomials f and g over K, and if 0 k € K, then 
kd is also an hcf for f and g. 

If d and e are two hcf’s for f and g, then there exists a non-zero element k € K 
such that e = kd. 

Proof Clearly kd\f and kd\g. If e\f and e\g then e\d so that e\kd. Hence kd is an 
hcf. 

If d and e are hcf’s then by definition e\d and d\e. Thus e = kd for some polyno¬ 
mial k. Since e\d the degree of e is less than or equal to the degree of d, so k must 
have degree < 0. Therefore A; is a constant, and so belongs to K. Since 0 f e = kd, 
we must have kf 0. □ 

We shall prove that any two non-zero polynomials have an hcf by providing a 
method to calculate one. This method is a generalisation of the technique used by 
Euclid (Elements Book 7 Proposition 2) around 600 BC for calculating hcf’s of inte¬ 
gers, and is accordingly known as the Euclidean Algorithm. 
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Algorithm 3.6 (Euclidean Algorithm). Ingredients Two polynomials / and g over 
K , both non-zero. 

Recipe For notational convenience let / = r_i, g = ro. Use the Division Algo¬ 
rithm to find successively polynomials qj and r\ such that 

r -1 =qir 0 + n 
rt) = qir\ +r 2 
n =q3r 2 + r 3 

n = qt+in+i + n+2 


dr\ < dro 
dr 2 < dr\ 
dr$ < dr 2 


(3.1) 


Since the degrees of the r, decrease, we must eventually reach a point where the 
process stops; this can happen only if some r s+2 = 0. The last equation in the list 
then reads 


r s = qs+ir s +i 


(3.2) 


and it provides the answer we seek: 

Theorem 3.7. With the above notation, r i+ i is an hcffor f and g. 

Proof. First we show that r s+ i divides both / and g. We use descending induction 
to show that r i+ i|r,- for all i. Clearly r s+ i|r s+ i. Equation (3.2) shows that r s+ ] \r s . 
Equation (3.1) implies that if r s+ i\rj +2 and r s+ ] |r,- + i then r s+ i|r,-. Hence r 1+ i|r,- for 
all i; in particular r s+ i |ro = g and r 1+ i |r_i = /. 

Now suppose that e\f and e\g. By (3.1) and induction, e\r; for all i. In particular, 
e\r s+ \. Therefore r 1+ i is an hcf for / and g, as claimed. □ 

Example 3.8. Let / = t 4 +2f 3 +2f 2 +2f + 1, g = t 2 — 1 over Q. We compute an hcf 
as follows: 

t 4 +2t 3 +2t 2 + 2t + \ = (t 2 + 2f + 3)(f 2 -l)+4f + 4 
t 2 ~\ = (4t+4)( l -t- 1 -) 

Hence 4f +4 is an hcf. So is any rational multiple of it, in particular, t + 1. 

We end this section by deducing from the Euclidean Algorithm an important 
property of the hcf of two polynomials. 

Theorem 3.9. Let f and g be non-zero polynomials over K, and let d be an hcffor 
f and g. Then there exist polynomials a and b over K such that 

d = af + bg 

Proof. Since hcf’s are unique up to constant factors we may assume that d = r s V \ 
where equations (3.1) and (3.2) hold. We claim as induction hypothesis that there 
exist polynomials a, and b, such that 


d = ain + bin+i 
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This is clearly true when i = s + 1, for we may then take a, = 1, /?, = 0. By (3.1) 

n +1 = n-i-qi+m 

Hence by induction 

d = ain + bi(n^i-q i+l n) 

so that if we put 

at -1 = bi bj -1 = a { - b t q t+ ] 

we have 

d = a ,-1 r ,-_1 

Hence by descending induction 

c/ = a_ir_i +b-iro = af + bg 

where a = a_ i, b = b- 1 . This completes the proof. □ 

The induction step above affords a practical method of calculating a and b in any 
particular case. 


3.2 Irreducibility 

Now we investigate the analogue, for polynomials, of prime numbers. The con¬ 
cept required is ‘irreducibility’. In particular, we prove that every polynomial over a 
subring of C can be expressed as a product of irreducibles in an ‘essentially’ unique 
way. 

An integer is prime if it cannot be expressed as a product of smaller integers. The 
analogue for polynomials is similar: we interpret ‘smaller’ as ‘smaller degree’. So 
the following definition yields the polynomial analogue of a prime number. 

Definition 3.10. A non-constant polynomial over a subring R of C is reducible if it 
is a product of two polynomials over R of smaller degree. Otherwise it is irreducible. 

Examples 3.11. (1) All polynomials of degree 1 are irreducible, since they certainly 
cannot be expressed as a product of polynomials of smaller degree. 

(2) The polynomial t 2 — 2 is irreducible over Q. To show this we suppose, for a 
contradiction, that it is reducible. Then 

t 2 — 2 = (at + b) (ct + d ) 

where a,b,c,d,G Q. Dividing out if necessary we may assume a = c 1. Then 
b + d = 0 and bd = —2, so that b 2 = 2. But no rational number has its square equal 
to 2 (Exercise 1.2). 
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(3) However, r 2 — 2 is reducible over the larger subfield R, for now 

t 2 -2 = (t-V2)(t + V2) 

This shows that an irreducible polynomial may become reducible over a larger sub- 
field of C. 

(4) The polynomial 6f + 3 is irreducible in Z[t], Although it has factors 

6f + 3 = 3(2f + l) 

the degree of 2f + 1 is the same as that of 6t+6. So this factorisation does not count. 

(5) The constant polynomial 6 is irreducible in Z[t], Again, 6 = 2-3 does not count. 

Any reducible polynomial can be written as the product of two polynomials of 
smaller degree. If either of these is reducible it too can be split up into factors of 
smaller degree ... and so on. This process must terminate since the degrees cannot 
decrease indefinitely. This is the idea behind the proof of: 

Theorem 3.12. Any non-zero polynomial over a subring R of C is a product of 
irreducible polynomials over R. 

Proof. Let g be any non-zero polynomial over R. We proceed by induction on the 
degree of g. If dg = 0 or 1 then g is automatically irreducible. If dg > 1, then either 
g is irreducible or g = hk where dh.dk < dg. By induction, h and k are products 
of irreducible polynomials, whence g is such a product. The theorem follows by 
induction. □ 

Example 3.13. We can use Theorem 3.12 to prove irreducibility in some cases, 
especially for cubic polynomials over Z. For instance, let R = Z. The polynomial 

f(t)=t 3 -5t + l 

is irreducible. If not, then it must have a linear factor t — a over Z, and then a € Z 
and f(a) = 0. Moreover, there must exist /3,ye Z such that 

fit) = (t - a)(t 2 + fit + y) 

= f 3 + (j3 - a)t 2 + (y- afi)t - ay 

so in particular ay = —1. Therefore a = ±1. But /(1) = —3^0 and /(—1) =5^0. 
Therefore no such factor exists. 

Irreducible polynomials are analogous to prime numbers. The importance of 
prime numbers in Z stems in part from the possibility of factorising every integer 
into primes, but even more so from the uniqueness (up to order) of the prime factors. 
Likewise the importance of irreducible polynomials depends upon a uniqueness the¬ 
orem. Uniqueness of factorisation is not obvious, see Stewart and Tall (2002) Chapter 
4. In certain cases it is possible to express every element as a product of irreducible 
elements, without this expression being in any way unique. We shall heed the warn¬ 
ing and prove the uniqueness of factorisation for polynomials. To avoid technical 
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issues like those in Examples 3.1(4,5), we restrict attention to polynomials over a 
sub field K of C. It is possible to prove more general theorems by introducing the 
idea of a ‘unique factorisation domain’, see Fraleigh (1989) Chapter 6. 

For convenience we make the following: 

Definition 3.14. If / and g are polynomials over a subfield K of C with hcf equal to 
1, we say that / and g are coprime , or / is prime to g. (The common phrase ‘coprime 
to’ is wrong. The prefix ‘co’ and the ‘to’ say the same thing, so it is redundant to use 
both.) 


The key to unique factorisation is a statement analogous to an important property 
of primes in Z, and is used in the same way: 


Lemma 3.15. Let K be a subfield ofC, f an irreducible polynomial over K, and g. h 
polynomials over K. If f divides gh, then either f divides g or f divides h. 


Proof. Suppose that f\g. We claim that / and g are coprime. For if d is an hcf for / 
and g, then since / is irreducible and d\f, either d = kf for some k € K, or d = k € K. 
In the first case f\g, contrary to hypothesis. In the second case, 1 is also an hcf for / 
and g, so they are coprime. By Theorem 3.9, there exist polynomials a and b over K 
such that 

1 =af + bg 


Then 


h = haf + hbg 


Now f\haf, and f\hbg since f\gh. Hence f\h. This completes the proof. 


□ 


We may now prove the uniqueness theorem. 

Theorem 3.16. For any subfield K of C, factorisation of polynomials over K into 
irreducible polynomials is unique up to constant factors and the order in which the 
factors are written. 

Proof. Suppose that / = /t • • -f r = gi ■ ■ -gs where / is a polynomial over K and 
f] .... ,f r . g \.... ,g s are irreducible polynomials over K. If all the f are constant then 
/ € K. so all the gj are constant. Otherwise we may assume that no / is constant, 
by dividing out all of the constant terms. Then f\ |gi.. ,g s . By an obvious induction 
based on Lemma 3.15, f\\gj for some j. We can choose notation so that j = 1, and 
then f\\g\. Since f\ and g\ are irreducible and f\ is not a constant, we must have 
/l = kig l for some constant k\. Similarly /2 = kigi, • ••,/• = k r g r where k 2 ,...,k r 
are constant. The remaining gfil > r) must also be constant, or else the degree of the 
right-hand side would be too large. The theorem is proved. □ 
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3.3 Gauss’s Lemma 

It is in general very difficult to decide—without using computer algebra, at any 
rate—whether a given polynomial is irreducible. As an example, think about 

t 16 + t 15 +t 14 +t 13 + t 12 + t U +t 10 + t 9 + t s + t 7 +t 6 +t 5 +t 4 + t 3 + t 2 + t + 1 (3.3) 

This is not an idle example: we shall be considering precisely this polynomial in 
Chapter 20, in connection with the regular 17-gon, and its irreducibility (or not) will 
be crucial. 

To test for irreducibility by trying all possible factors is usually futile. Indeed, 
at first sight there are infinitely many potential factors to try, although with suitable 
short cuts the possibilities can be reduced to a finite—usually unfeasibly large— 
number. In principle the resulting method can be applied to polynomials over Q, for 
example: see van der Waerden (1953), Garling (1960). But the method is not really 
practicable. 

Instead, we have to invent a few useful tricks. In the next two sections we describe 
two of them: Eisenstein’s Criterion and reduction modulo a prime. Both tricks apply 
in the first instance to polynomials over Z. However, we now prove that irreducibility 
over Z is equivalent to irreducibility over Q. This extremely useful result was proved 
by Gauss, and we use it repeatedly. 

Lemma 3.17 (Gauss’s Lemma). Let f be a polynomial over Z that is irreducible 
over Z. Then f, considered as a polynomial over Q, is also irreducible over Q. 

Proof. The point of this lemma is that when we extend the subring of coefficients 
from Z to Q, there are hosts of new polynomials which, perhaps, might be factors of 
/. We show that in fact they are not. For a contradiction, suppose that / is irreducible 
over Z but reducible over Q, so that / = gh where g and h are polynomials over Q, 
of smaller degree, and seek a contradiction. Multiplying through by the product of 
the denominators of the coefficients of g and h, we can rewrite this equation in the 
form nf = g'h', where n £ Z and g', h' are polynomials over Z. We now show that 
we can cancel out the prime factors of n one by one, without going outside Z[t\. 
Suppose that p is a prime factor of n. We claim that if 

g' — go + g 1 1 H-+ gi-t' h' = ho + hit 4-+ h s t s 

then either p divides all the coefficients g,. or else p divides all the coefficients hj. 
If not, there must be smallest values i and j such that p\g, and p\h;. However, p 
divides the coefficient of t' + ! in g'h!, which is 


hogi+j + hgi+j- 1 H-b hjgi H-b hi+jgo 


and by the choice of i and j, the prime p divides every term of this expression except 
perhaps hjgi. Butp divides the whole expression, so p\hjgi. However, p\hj and p\g t , 
a contradiction. This establishes the claim. 
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Without loss of generality, we may assume that p divides every coefficient g,. 
Then g' = pg" where g" is a polynomial over Z of the same degree as g' (or g). Let 
n = pn\. Then pn\f = pg"h', so that n\f = g"W . Proceeding in this way we can 
remove all the prime factors of n, arriving at an equation / = gh. Here g and h are 
polynomials over Z, which are rational multiples of the original g and h, so dg = dg 
and dh = dh. But this contradicts the irreducibility of / over Z, so the lemma is 
proved. □ 

Corollary 3.18. Let f £ Z[f] and suppose that over Q[f] there is a factorisation into 
irreducibles: 

f 81 ■•• 8 s 

Then there exist at £ Q such that ajgj £ Z[f] and a\.. .a s = 1. Furthermore, 

f={a 1 g 1 )...(a s g s ) 

is a factorisation of f into irreducibles in Z[t\. 

Proof Factorise / into irreducibles over Z[f], obtaining / = h\ ...h r . By Gauss’s 
Lemma, each hj is irreducible over Q. By uniqueness of factorisation in Q[f], we 
must have r = s and h ; = ajgj for c/ ; £ Q. Clearly a\ ... a s = 1. The Corollary is now 
proved. □ 


3.4 Eisenstein’s Criterion 

No, not ‘Einstein’. Ferdinand Gotthold Eisenstein was a student of Gauss, and 
greatly impressed his tutor. We can apply the tutor’s lemma to prove the student’s 
criterion for irreducibility: 

Theorem 3.19 (Eisenstein’s Criterion). Let 

f(t) = (3o + a\t + • • • + a n t n 

be a polynomial over Z. Suppose that there is a prime q such that 

(1) q\a n 

(2) q\at (i = 0,...,n— 1) 

(3) q 2 \a 0 

Then f is irreducible over Q. 

Proof By Gauss’s Lemma it is sufficient to show that / is irreducible over Z. Sup¬ 
pose for a contradiction that / = gh, where 


g = bo + bit -h b r t 


h = co +cd H-f- c s t 
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are polynomials of smaller degree over Z. Then r > 1 , v > 1, and r + s = n. Now 
f>oco = no so by (2) </£>o or q\c$. By (3) q cannot divide both bo and cq, so without 
loss of generality we can assume q\bo , q\co- If all bj are divisible by q , then a n is 
divisible by q , contrary to (1). Let bj be the first coefficient of g not divisible by q. 
Then 

aj = bjCQ H-1- boCj 

where j < n. This implies that q divides co, since q divides aj,bo,.. .,bj- 1 , but not 
bj. This is a contradiction. Hence / is irreducible. □ 

Example 3.20. Consider 


f(t ) = 11 5 + ff 4 + f 3 + i over Q 
This is irreducible over Q if and only if 

9/(1) =2f 5 + 15f 4 + 9f 3 + 3 

is irreducible over Q. Eisenstein’s criterion now applies with q = 3, showing that / 
is irreducible. 

We now turn to the polynomial (3.3). This provides an instructive example that 
leads to a useful general result. In preparation, we prove a standard number-theoretic 
property of binomial coefficients: 

Lemma 3.21. If p is prime, the binomial coefficient 

P 
r 


is divisible by p if 1 < r < p — 1. 

Proof. The binomial coefficient is an integer, and 

(p) = _pL_ 

\r J rl(p — r)\ 

The factor p in the numerator cannot cancel with any factor in the denominator unless 
r = 0 or r = p. □ 

We then have: 

Lemma 3.22. If p is a prime then the polynomial 

ff) = l + t + ... +t P~ l 


is irreducible over Q. 



Reduction Modulo p 


57 


Proof. Note that fit) = (t p — 1) /(t — 1). Put t = 1 + u where u is a new indeterminate. 
Then f(t) is irreducible over Q if and only if /(I + u) is irreducible. But 


/( 1+m) 


(1 + u) p — 1 
u 


= u p 1 + ph(u) 


where h is a polynomial in u over Z with constant term 1, by Lemma 3.21. By Eisen- 
stein’s Criterion, Theorem 3.19, /(I +u) is irreducible over Q. Hence f{t) is irre¬ 
ducible over Q. □ 

Setting p = 17 shows that the polynomial (3.3) is irreducible over Q. 


3.5 Reduction Modulo p 

A second trick to prove irreducibility of polynomials in Z\t] involves ‘reducing’ 
the polynomial modulo a prime integer p. 

Recall that if n £ Z, two integers a,b are congruent modulo n, written 

a = b (mod n) 

if a — b is divisible by n. The number n is the modulus, and ‘modulo’ is Latin for 
‘to the modulus’. Congruence modulo n is an equivalence relation, and the set of 
equivalence classes is denoted by Z„. Arithmetic in Z„ is just like arithmetic in Z, 
except that n = 0 . 

The test for irreducibility that we now wish to discuss is most easily explained 
by an example. The idea is this. There is a natural map Z —> Z„ in which each m £ Z 
maps to its congruence class modulo n. The natural map extends in an obvious way to 
a map Z[f] —» Z„ [f], Now a reducible polynomial over Z is a product gh of polynomi¬ 
als of lower degree, and this factorisation is preserved by the map. Provided n does 
not divide the highest coefficient of the given polynomial, the image is reducible 
over Z„. So if the image of a polynomial is irreducible over Z„, then the original 
polynomial must be irreducible over Z. (The corresponding statement for reducible 
polynomials is in general false: consider t 2 — 2 £ Z[f] when p = 2.) Since Z„ is finite, 
there are only finitely many possibilities to check when deciding irreducibility. 

In practice, the trick is to choose the right value for n. 

Example 3.23. Consider 

/(f) = f 4 + 15f 3 + 7 over Z 

Over Z 5 this becomes f 4 + 2. If this is reducible over Z 5 , then either it has a factor of 
degree 1, or it is a product of two factors of degree 2. The first possibility gives rise 
to an element x £ Z 5 such that x 4 + 2 = 0. No such element exists (there are only five 



58 


Factorisation of Polynomials 



FIGURE 9: Multiple zeros of a (real) polynomial. The multiplicity is 1 at (A), 2 at 
(B), and 3 at (C). 


elements to check) so this case is ruled out. In the remaining case we have, without 
loss of generality, 

t 4 T 2 — (f ” T at T If) (t 2 T ct T d') 

Therefore a + c = O.ac + b + d = 0, ad + be = O.bd = 2. Combining ad + bc = 0 with 
a + c = 0 we get a(b — d) =0. So either a = 0 or b = d. 

If a = 0 then c = 0, so b + d = 0 ,bd = 2. That is, b 2 = —2 = 3 in Z 5 . But this is 
not possible. 

If b = d then b 2 = 2, also impossible in Z 5 . 

Hence t 4 + 2 is irreducible over Z 5 , and therefore the original f(t) is irreducible 
over Z, hence over Q. 

Notice that if instead we try to work in Z 3 , then f(t) becomes t 4 + 1, which 
equals (t 2 +1 — 1) (t 2 — t — 1) and so is reducible. Thus working (mod 3) fails to 
prove irreducibility. 


3.6 Zeros of Polynomials 

We have already studied the zeros of a polynomial over C. It will be useful to 
employ similar terminology for polynomials over a subring R of C, because then we 
can keep track of where the zeros lie. We begin with a formal definition. 

Definition 3.24. Let R be a subring of C, and let / be a polynomial over R. An 
element a € R such that f(a) = 0 is a zero of f in R. 

To illustrate some basic phenomena associated with zeros, we consider polyno¬ 
mials over the real numbers. In this case, we can draw the graph y = f(x) (in standard 
terminology, with x € R in place of t). The graph might, for example, resemble Fig¬ 
ure 9. 

The zeros of / are the values of x at which the curve crosses the x-axis. Consider 
the three zeros marked A, B, C in the diagram. At A the curve cuts straight through 
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the axis; at B it ‘bounces’ off it; at C it ‘slides’ through horizontally. These phenom¬ 
ena are generally distinguished by saying that B and C are ‘multiple zeros’ of /(f). 
The single zero B must be thought of as two equal zeros (or more) and C as three (or 
more). 

But if they are equal, how can there be two of them? The answer is the concept of 
‘multiplicity’ of a zero, introduced in Section 2.3. We now reformulate this concept 
without using the Fundamental Theorem of Algebra, which in this context is the 
proverbial nut-cracking sledgehammer. The key is to look at linear factors of /. 

Lemma 3.25. Let f be a polynomial over the subfield K ofC. An element a £ K is 
a zero of f if and only if (f — a)\f(t) in K[t ]. 

Proof We know that (f — a)\ f(t) in C[f] by Theorem 2.5, but we want slightly more. 
If (f — a) |/(f) in K[t], then /(f) = (f — Ot,)g(t) for some polynomial g over K , so that 
f(a) = {a-a)g(a) = 0. 

Conversely, suppose /(a) = 0. By the Division Algorithm, there exist polynomi¬ 
als q,r £ K[t] such that 

/(f) = (f - a)q(t) + r(f) 
where dr < 1. Thus r(t) = r £ K. Substituting a for f, 

0 = f(a) = (a - a)q(a) + r 

so r = 0. Hence (f — a) |/(f) £ K [f] as required. □ 

We can now say what we mean by a multiple zero, without appealing to the 
Fundamental Theorem of Algebra. 

Definition 3.26. Let / be a polynomial over the subfield K of C. An element a £ K 
is a simple zero of / if (f — a)|/(f) but (f — a ) 2 j/(f). The element a is a zero of / 
of multiplicity m if (f — a) m \ f(t) but (f — a) m+l \f{t). Zeros of multiplicity greater 
than 1 are repeated or multiple zeros. 

For example, f 3 — 3f+2 over Q has zeros at a = 1,-2. It factorises as (f — l) 2 (f + 
2). Hence —2 is a simple zero, while 1 is a zero of multiplicity 2. 

When K =R and we draw a graph, as in Figure 9, points like A are the simple 
zeros; points like B are zeros of even multiplicity; and points like C are zeros of 
odd multiplicity > 1. For subfields of C other than R (except perhaps Q, or other 
subfields of R) a graph has no evident meaning, but the simple geometric picture for 
R is often helpful. 

Lemma 3.27. Let f be a non-zero polynomial over the subfield K ofC, and let its 
distinct zeros be (Xi,..., CC r with multiplicities mi,..., m r respectively. Then 

f(t) = (t-ai) mi ...(t-a r ) m '-g(t) (3.4) 

where g has no zeros in K. 

Conversely, if ( 3.4) holds and g has no zeros in K, then the zeros of f in K are 
(X\, ..., a,, with multiplicities mi,..., m r respectively. 
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Proof. For any a GK the polynomial t — a is irreducible. Hence for distinct CX.fi GK 
the polynomials t — a and t p are coprime in K[t}. By uniqueness of factorisation 
(Theorem 3.16) equation (3.4) must hold. Moreover, g cannot have any zeros in K. 
or else / would have extra zeros or zeros of larger multiplicity. 

The converse follows easily from uniqueness of factorisation. Theorem 3.12 and 
Theorem 3.16. □ 

From this lemma we deduce a famous theorem: 

Theorem 3.28. The number of zeros of a nonzero polynomial over a subfield of C, 
counted according to multiplicity, is less than or equal to its degree. 

Proof. In equation (3.4) we must have m\ 4-h m r <df. □ 


EXERCISES 

3.1 For the following pairs of polynomials / and g over Q, find the quotient and 
remainder on dividing g by /. 

(a) g = t 1 -t 3 + 5,f = t 3 +l 

(b) g = t 2 + \J = t 2 

(c) g = 4f 3 — lit 2 +f — 3, / = 2t + 5 

(d) g = t 4 -\J = t 2 + \ 

(e) g = t 4 -ff = 3t 2 + 3t 

3.2 Find hcf’s for these pairs of polynomials, and check that your results are com¬ 
mon factors of / and g. 

3.3 Express these hcf’s in the form af + bg. 

3.4 Decide the irreducibility or otherwise of the following polynomials: 

(a) t 4 + 1 over R. 

(b) t 4 + 1 over Q. 

(c) t 7 + 1 It 3 — 33t + 22 over Q. 

(d) t 4 +1 3 + 1 2 +1 + 1 over Q. 

(e) f 3 — It 2 + 3t + 3 over Q. 

3.5 Decide the irreducibility or otherwise of the following polynomials: 

(a) t 4 + f 3 +t 2 +t + 1 over Q. (Hint: Substitute t + 1 in place of t and appeal 
to Eisenstein’s Criterion.) 

(b) t 5 + 1 4 +1 3 + 1 2 +1 + 1 over Q. 
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(c) t' 6 +1 5 + 1 4 + f 3 + 1 2 +1 + 1 over Q. 

3.6 In each of the above cases, factorise the polynomial into irreducibles. 

3.7 Say that a polynomial / over a subfield K of C is prime if whenever f\gh 
either f\g or f\h. Show that a polynomial f ^ 0 is prime if and only if it is 
irreducible. 

3.8 Find the zeros of the following polynomials; first over Q, then R, then C. 

(a) t 3 +1 

(b) t 3 -6t 2 + llt-6 

(c) f 5 +f + l 

(d) t 2 + 1 

(e) t 4 + t 3 +t 2 +t+ 1 

(f) f 4 — 6f 2 + 11 

3.9 Mark the following true or false. (Here ‘polynomial’ means ‘polynomial over 
C’.) 

(a) Every polynomial of degree n has n distinct zeros. 

(b) Every polynomial of degree n has at most n distinct zeros. 

(c) Every polynomial of degree n has at least n distinct zeros. 

(d) If f,g are non-zero polynomials and / divides g, then df < dg. 

(e) If /, g are non-zero polynomials and / divides g, then df < dg. 

(f) Every polynomial of degree 1 is irreducible. 

(g) Every irreducible polynomial has prime degree. 

(h) If a polynomial / has integer coefficients and is irreducible over Z, then 
it is irreducible over Q. 

(i) If a polynomial / has integer coefficients and is irreducible over Z, then 
it is irreducible over R. 

(j) If a polynomial / has integer coefficients and is irreducible over R, then 
it is irreducible over Z. 
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Chapter 4 


Field Extensions 


Galois’s original theory was couched in terms of polynomials over the complex field. 
The modern approach is a consequence of the methods used, starting around 1890 
and flourishing in the 1920s and 1930s, to generalise the theory to arbitrary fields. 
From this viewpoint the central object of study ceases to be a polynomial, and be¬ 
comes instead a ‘field extension’ related to a polynomial. Every polynomial / over a 
field K defines another field L containing K (or at any rate a subfield isomorphic to 
K). There are conceptual advantages in setting up the theory from this point of view. 
In this chapter we define field extensions (always working inside C) and explain the 
link with polynomials. 


4.1 Field Extensions 

Suppose that we wish to study the quartic polynomial 

f(t)=t 4 -4t 2 -5 


over Q. Its irreducible factorisation over Q is 

m = (t 2 +l)(t 2 -5) 

so the zeros of / in C are ±i and ±\/5. There is a natural sub field L of C associated 
with these zeros; in fact, it is the unique smallest subfield that contains them. We 
claim that L consists of all complex numbers of the form 

p + qi + rV5+siV5 (p,q,r,s£ Q) 

Clearly L must contain every such element, and it is not hard to see that sums and 
products of such elements have the same form. It is harder to see that inverses of 
(non-zero) such elements also have the same form, but it is true: we postpone the 
proof to Example 4.8. Thus the study of a polynomial over Q leads us to consider a 
subfield L of C that contains Q. In the same way the study of a polynomial over an 
arbitrary subfield K of C will lead to a subfield L of C that contains K. We shall call 
L an ‘extension’ of K. For technical reasons this definition is too restrictive; we wish 
to allow cases where L contains a subfield isomorphic to K, but not necessarily equal 
to it. 


63 



64 


Field Extensions 


Definition 4.1. A field extension is a monomorphism i : K —> L, where K and L are 
subfields of C. We say that K is the small field and L is the large field. 

Notice that with a strict set-theoretic definition of function, the map i determines 
both K and L. See Definition 1.3 for the definition of ‘monomorphism’. We often 
think of a field extension as being a pair of fields {K,L), when it is clear which 
monomorphism is intended. 

Examples 4.2. 1. The inclusion maps ii : Q —> R, 12 : R. —> C, and 13 : Q —> C are all 
field extensions. 

2. Let K be the set of all real numbers of the form p + q\J 2, where p.q € Q. Then K 
is a subfield of C by Example 1.7. The inclusion map 1 : Q —>■ K is a field extension. 

If 1 : K —> L is a field extension, then we can usually identify K with its image 
l(K), so that 1 can be thought of as an inclusion map and K can be thought of as a 
subfield of L. Under these circumstances we use the notation 

L-.K 

for the extension, and say that L is an extension of K. In future we shall identify K 
and 1 ( K ) whenever this is legitimate. 

The next concept is one which pervades much of abstract algebra: 

Definition 4.3. Let X be a subset of C. Then the subfield of C generated by X is the 
intersection of all subfields of C that contain X. 

It is easy to see that this definition is equivalent to either of the following: 

1. The (unique) smallest subfield of C that contains X. 

2. The set of all elements of C that can be obtained from elements of X by a finite 
sequence of field operations, provided X / {0} or 0. 

Proposition 4.4. Every subfield of C contains Q. 

Proof Let /tCCbe a subfield. Then 0,1 £ K by definition, so inductively we find 
that 1 +... + 1 = n lies in K for every integer n > 0. Now K is closed under additive 
inverses, so —n also lies in K , proving that Z C K. Finally, if p,q £ Z and q f 0, 
closure under products and multiplicative inverses shows that pq 1 £ K. Therefore 
Q C K as claimed. □ 

Corollary 4.5. Let X be a subset of C. Then the subfield of C generated by X con¬ 
tains Q. 

Because of Corollary 4.5, we use the notation 

QPO 


for the subfield of C generated by X. 
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Example 4.6. We find the subfield K of C generated by X = {1,/}. By Proposi¬ 
tion 4.4, K must contain Q. Since K is closed under the arithmetical operations, it 
must contain all complex numbers of the form p + qi , where p,q £ Q. Let M be the 
set of all such numbers. We claim that M is a subfield of C. Clearly M is closed under 
sums, differences, and products. Further 

t , -\-i P V ■ 

(p + qi) = , , - , + 

p~ + q~ p~+ q 

so that every non-zero element of M has a multiplicative inverse in M. Hence M is 
a subfield, and contains X. Since K is the smallest subfield containing X, we have 
K CM. But M C K by definition. Hence K =M, and we have found a description of 
the subfield generated by X. 

In the case of a field extension L : K we are mainly interested in subfields lying 
between K and L. This means that we can restrict attention to subsets X that contain 
K; equivalently, to sets of the form K U Y where Y CL. 

Definition 4.7. If L : K is a field extension and Y is a subset of L. then the subfield of 
C generated by K U Y is written K(Y ) and is said to be obtained from K by adjoining 
Y. 


Clearly K(Y) CL since L is a subfield of C. Notice that K(Y) is in general con¬ 
siderably larger than KUY. 

This notation is open to all sorts of useful abuses. If Y has a single element y 
we write K(y) instead of /C({y}), and in the same spirit K(yi,...,y n ) will replace 

K{{yi,...,y n })- 

Example 4.8. Let K = Q and let Y = {/, x/5}. Then K( Y) must contain K and Y. 
It also contains the product is/ 5. Since K D Q, the subfield K(Y) must contain all 
elements 

a = p + qi + r\J 5 + si sfl (p,q,r,s G Q). 

Let ICCbe the set of all such a. If we prove that L is a subfield of C, then it follows 
that K(Y) = L. Moreover, it is easy to check that L is a subring of C, hence L is a 
subfield of C if and only if for a^Owe can find an inverse OC 1 £ L. If fact, we shall 
prove that if (p,q,r,s) / (0,0,0,0) then a / 0, and then 

(p + qi + rs/5 + sis/5)- 1 £ L 

First, suppose that p + qi + r\/5 + sis /5 = 0. Then 

p + rs /5 = —i(q + ss/ 5) 

Now both p+rs /5 and — (q+ss/5) are real, but i is imaginary. Therefore p + rs /5 = 0 
and q + ss/5 =0. If r / 0 then s/5 = —p/r£ Q, but s/5 is irrational. Therefore r = 0, 
whence p = 0. Similarly, q = s = 0. 

Now we prove the existence of a -1 in two stages. Let M be the subset of L 
containing all p + qi (p,q £ Q). Then we can write 
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where x = p + iq and y = r + is G M. Let 

P = p + qi— rs/S — sis/5 =x — ys/ 5 € L 

Then 

ap = (x+yV5)(x — yV5) =x 2 — 5 y 2 = z 

say, where zGM. Since a / O and /3 ^ 0 we have z ^ 0, so a -1 = /3z _1 . Now write 
Z = u + vi (u,v G Q) and consider w = u — vi. Since zw = ir + v 2 G Q we have 

z~‘ = (u 2 +v 2 ) _1 w e At 

so a = /ItT 1 G L. 

Alternatively, we can obtain an explicit formula by working out the expression 

(p + qi + rV5 +siV5)(p — qi + rV5 — sis/5) 
x (p + qi — rV5 — siV5)(p — qi — rV5 +si\/5) 

and showing that it belongs to Q, and then dividing out by 

(p + qi + rV 5 + si V5) 


See Exercise 4.6. 

Examples 4.9. (1) The subfield R(i) of C must contain all elements x + iy where 
x,y G M. But those elements comprise the whole of C. Therefore C = R(i). 

(2) The subfield P of R consisting of all numbers p + q\J 2 where p,q G Q is easily 
seen to equal Q(v^). 

(3) It is not always true that a subfield of the form Kia ) consists of all elements of 
the form j + ka where j. k G K. It certainly contains all such elements, but they need 
not form a subfield. 

For example, in R : Q let a be the real cube root of 2, and consider Q(a). As well 
as a, the subfield Q(a) must contain a 2 . We show that a 2 ^ j + ka for j,k G Q. For 
a contradiction, suppose that a 2 = j+ka. Then2 = a 3 = ja+ka 2 = jk+(j+k 2 )a. 
Therefore (j+k 2 )a = 2 — jk. Since a is irrational, (j + k 2 ) = 0 = 2 — jk. Eliminating 
j , we find that k 2 = 2, contrary to k G Q. 

In fact, Q(a) is precisely the set of all elements of R of the form p + qa + ra 2 , 
where p,q , r G Q. To show this, we prove that the set of such elements is a subfield. 
The only (minor) difficulty is finding a multiplicative inverse: see Exercise 4.7. 


4.2 Rational Expressions 

We can perform the operations of addition, subtraction, and multiplication in the 
polynomial ring C[f], but (usually) not division. For example, C[t] does not contain 
an inverse t~ l for t. see Exercise 4.8. 
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However, we can enlarge C[f] to provide inverses in a natural way. We have seen 
that we can think of polynomials f(t) £ C[f] as functions from C to itself. Similarly, 
we can think of fractions p{t)/q{t) £ C (t) as functions. These are called rational 
functions of the complex variable t , and their formal statements in terms of poly¬ 
nomials are rational expressions in the indeterminate t. However, there is now a 
technical difficulty. The domain of such a function is not the whole of C: all of the 
zeros of q(t) have to be removed, or else we are trying to divide by zero. Complex 
analysts often work in the Riemann sphere C U{°°}, and cheerfully let 1/°° = 0, but 
care must be exercised if this is done; the civilised way to proceed is to remove all 
the potential troublemakers. So we take the domain of p(t)/q(t) to be 

{z G C : q{z) ± 0} 

As we have seen, any complex polynomial q has only finitely many zeros, so the 
domain here is ‘almost all’ of C. We have to be careful, but we shouldn’t get into 
much trouble provided we are. 

In the same manner we can also construct the set 

C{tii... ftn) 

of all rational functions in n variables (rational expressions in n indeterminates). One 
use of such functions is to specify the sub field generated by a given set X. It is 
straightforward to prove that Q(X) consists of all rational expressions 

p(Cti,..., GC n ) 

• • • j fin) 


for all n , where p,q £ the a ; - and j3/ belong to A, and q(fi\ ,..., fi„) ^ 0. 

See Exercise 4.9. 

It is also possible to define such expressions without using functions. See ‘field 
of fractions’ in Chapter 16, immediately after Corollary 16.18. This approach is nec¬ 
essary in the more abstract development of the subject. 


4.3 Simple Extensions 

The basic building-blocks for field extensions are those obtained by adjoining 
one element: 

Definition 4.10. A simple extension is a field extension L : K such that L = K(a) for 
some a £ L. 

Examples 4.11. (1) As the notation shows, the extensions in Examples 4.9 are all 
simple. 

(2) Beware: An extension may be simple without appearing to be. Consider L = 
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\/5, — \/5). As written, it appears to require the adjunction of four new elements. 
Clearly just two, i and \/5, suffice. But we claim that in fact only one element is 
needed, because L = L' where L' = Q (i + x^5), which is obviously simple. To prove 
this, it is enough to show that i £ L' and >/5 £ L', because these imply that L CL' 
and L' C L, so L = L'. Now L' contains 

(i + V5) 2 = -l+2iV5 + 5=4 + 2iV5 

Thus it also contains 

(t + V / 5)(4 + 2iv / 5) = 14/-2V5 

Therefore it contains 

14/ —2v / 5 + 2(i + v / 5) = 16/ 

so it contains /. But then it also contains (/ + \/5) — i = V 5. Therefore L = L 1 as 
claimed, and the extension Q(i, — i, a/ 5, — x/5) : Q is in fact simple. 

(3) On the other hand, K : Q is not a simple extension (Exercise 4.5). 

Our aim in the next chapter will be to classify all possible simple extensions. 
We end this chapter by formulating the concept of isomorphism of extensions. In 
Chapter 5 we will develop techniques for constructing all possible simple extensions 
up to isomorphism. 

Definition 4.12. An isomorphism between two field extensions i : K —> K,j : L —>■ L 
is a pair (A, /J.) of field isomorphisms A : K —> L, /i : K —> L, such that for all k £ K 

j{X{k)) = p(i(k)) 

Another, more pictorial, way of putting this is to say that the diagram 

K ^ K 
li ^ ill 
L J L 

commutes', that is, the two paths from K to L compose to give the same map. 

The reason for setting up the definition like this is that as well as the field structure 
being preserved by isomorphism, the embedding of the small field in the large one is 
also preserved. 

Various identifications may be made. If we identify K and i (K), and L and j(L), 
then I and j are inclusions, and the commutativity condition now becomes 

Ii\k = A 

where Li \k denotes the restriction of Ll to K. If we further identify K and L then A 
becomes the identity, and so t /\ K is the identity. In what follows we shall attempt to 
use these ‘identified’ conditions wherever possible. But on a few occasions (notably 
Theorem 9.6) we shall need the full generality of the first definition. 
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EXERCISES 


4.1 Prove that isomorphism of field extensions is an equivalence relation. 

4.2 Find the subfields of C generated by: 

(a) { 0 , 1 } 

(b) { 0 } 

(c) { 0 , 1 ,/} 

(d) {i,V 2 } 

(e) {72,73} 

(f) R 

(g) Ru{i} 

4.3 Describe the subfields of C of the form 


(a) Q(72) 

(b) Q(/) 

(c) Q(a) where a is the real cube root of 2 

(d) Q(V5,V7) 

(e) Q(ivTT) 

(f) Q(e 2 + 1 ) 

(g) Q(T^) 

4.4 This exercise illustrates a technique that we will tacitly assume in several sub¬ 
sequent exercises and examples. 

Prove that 1,72, 73, 76 are linearly independent over Q. 

(Hint: Suppose that p + q\/2 + ry/3 + ss/6 = 0 with p,q,r } s € Q. We may 
suppose that r ^ 0 or s ^ 0 (why?). If so, then we can write y/3 in the form 
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a + b\/l 
c + d\/2 


=e+fV 2 


where u,/ 7 ,c,£/,e,/ € Q. Square both sides and obtain a contradiction.) 


4.5 Show that R is not a simple extension of Q as follows: 


(a) Q is countable. 

(b) Any simple extension of a countable field is countable. 

(c) R is not countable. 

4.6 Find a formula for the inverse of p + qi + r \/5 + si\J 5, where p,q,r,s € Q. 
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4.7 Find a formula for the inverse of p + qa + ra 2 , where p, q, r G Q and a = s/2. 

4.8 Prove that 1 has no multiplicative inverse in C[f], 

4.9 Prove that Q(X) consists of all rational expressions 

p(c% 15 ... ? ) 

q(fi 11 ■ • • ■ fin ■' 

for all n, where p,q £ Q[fi,... the (Xj and /3 ; - belong to X, and 

q(Pi,■■■,&») 7^o. 

4.10 Mark the following true or false. 

(a) If X is the empty set then Q(X) = Q. 

(b) If X is a subset of Q then Q(X) = Q. 

(c) If X contains an irrational number, then Q(X) ^ Q. 

(d) Q(v / 2) = Q- 

(e) Q(V2) = K. 

(f) K(v^) = K. 

(g) Every subfield of C contains Q. 

(h) Every subfield of C contains R. 

(i) If a ^ j3 and both are irrational, then Q(a,/3) is not a simple extension 
' of Q. 



Chapter 5 

Simple Extensions 


The basic building block of field theory is the simple field extension. Here one new 
element a is adjoined to a given subfield K of C, along with all rational expressions in 
that element over K. Any finitely generated extension can be obtained by a sequence 
of simple extensions, so the structure of a simple extension provides vital information 
about all of the extensions that we shall encounter. 

We first classify simple extensions into two very different kinds: transcendental 
and algebraic. If the new element a satisfies a polynomial equation over K, then 
the extension is algebraic; if not, it is transcendental. Up to isomorphism, K has 
exactly one simple transcendental extension. For most fields K there are many more 
possibilities for simple algebraic extensions; they are classified by the irreducible 
polynomials m over K. 

The structure of simple algebraic extensions can be described in terms of the 
polynomial ring K\t\, with operations being performed ‘modulo m’. In Chapter 16 
we generalise this construction using the notion of an ideal. 


5.1 Algebraic and Transcendental Extensions 

Recall that a simple extension of a subfield K of C takes the form K(a) where in 
nontrivial cases a / K. We classify the possible simple extensions for any K. There 
are two distinct types: 

Definition 5.1. Let K be a subfield of C and let a £ C. Then a is algebraic over K 
if there exists a non-zero polynomial p over K such that p(u) = 0. Otherwise, a is 
transcendental over K. 

We shorten ‘algebraic over Q’ to ‘algebraic’, and ‘transcendental over Q’ to 
‘transcendental’. 

Examples 5.2. (1) The number a = \/2 is algebraic, because a 2 — 2 = 0. 

(2) The number a = s/2 is algebraic, because a 3 — 2 = 0. 

(3) The number n = 3 • 14159... is transcendental. We postpone a proof to Chapter 
24. In Chapter 7 we use the transcendence of K to prove the impossibility of ‘squaring 
the circle’. 

(4) The number a = \J% is algebraic over Q(?r), because a 2 — n = 0. 
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(5) However, a = yjn is transcendental over Q. To see why, suppose that p{\pK) — 0 
where 0 / p(t) G Q[f], Separating out terms of odd and even degree, we can write this 
as a(n) + b{n)yfn = 0, so a(n) = —b(n)\/n and a 2 (n) = nb 2 (n). Thus f(n) = 0, 
where 

f(t) = a 2 (t)-tb 2 (t) G Q[f] 

Now d(a 2 ) is even, and d(tb 2 ) is odd, so the difference /(f) is nonzero. But this 
implies that n is algebraic, a contradiction. 

In the next few sections we classify all possible simple extensions and find ways 
to construct them. The transcendental case is very straightforward: if K(t) is the 
set of rational functions of the indeterminate t over K, then K(t) : K is the unique 
simple transcendental extension of K up to isomorphism. If K (a) : K is algebraic, 
the possibilities are richer, but tractable. We show that there is a unique monic irre¬ 
ducible polynomial m over K such that m(a) = 0, and that in determines the exten¬ 
sion uniquely up to isomorphism. 

We begin by constructing a simple transcendental extension of any subfield. 

Theorem 5.3. The set of rational expressions K(t) is a simple transcendental exten¬ 
sion of the subfield K ofC. 

Proof Clearly K(t) : K is a simple extension, generated by t. If p is a polynomial 
over K such that p(t) = 0 then p = 0 by definition of K(t), so the extension is tran¬ 
scendental. □ 


5.2 The Minimal Polynomial 

The construction of simple algebraic extensions is a much more delicate issue. It 
is controlled by a polynomial associated with the generator a of K[a) : K, called the 
‘minimal polynomial’. (An alternative name often encountered is ‘minimum polyno¬ 
mial’.) To define it we first set up a technical definition. 

Definition 5.4. A polynomial /(f) = a o +a\t -h a„t" over a subfield K of C is 

monic if a n = 1. 

Clearly every polynomial is a constant multiple of some monic polynomial, and 
for a non-zero polynomial this monic polynomial is unique. Further, the product of 
two monic polynomials is again monic. 

Now suppose that K(a): K is a simple algebraic extension. There is a polynomial 
p over K such that p(<x) = 0. We may suppose that p is monic. Therefore there exists 
at least one monic polynomial of smallest degree that has a as a zero. We claim that 
p is unique. To see why, suppose that p. q are two such, then p ( a) —q(a) = 0, so if 
p / q then some constant multiple of p — q is a monic polynomial with a as a zero, 
contrary to the definition. Hence there is a unique monic polynomial p of smallest 
degree such that p(a) = 0. We give this a name: 
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Definition 5.5. Let L.K be a field extension, and suppose that aGLis algebraic 
over K. Then the minimal polynomial of a over K is the unique monic polynomial in 
over K of smallest degree such that m(a) =0. 

For example, i £ C is algebraic over R. If we let m[t) = t 2 + 1 then m(i) = 0. 
Clearly m is monic. The only monic polynomials over R of smaller degree are those 
of the form t + r, where r G R, or the constant polynomial 1. But i cannot be a zero of 
any of these, or else we would have i £ EL Hence the minimal polynomial of i over 
Mis t 2 + 1. 

It is natural to ask which polynomials can be minimal. The next lemma provides 
information on this question. 

Lemma 5.6. If a is an algebraic element over the subfield K ofC, then the minimal 
polynomial of a over K is irreducible over K. It divides every polynomial of which 
a is a zero. 

Proof Suppose that the minimal polynomial m of a over K is reducible, so that 
m = fg where / and g are of smaller degree. We may assume / and g are monic. 
Since m(a) = 0 we have f(a)g(a) = 0, so either /(a) = 0 or g(a) = 0. But this 
contradicts the definition of m. Hence m is irreducible over K. 

Now suppose that p is a polynomial over K such that p(<x) = 0. By the Division 
Algorithm, there exist polynomials q and r over K such that p = mq + r and dr < dm. 
Then 0 = p(cc) = 0 + r(cx). If r / 0 then a suitable constant multiple of r is monic, 
which contradicts the definition of m. Therefore r = 0, so m divides p. □ 

Conversely, if K is a subfield of C, then it is easy to show that any irreducible 
polynomial over K can be the mimimum polynomial of an algebraic element over K: 

Theorem 5.7. If K is any subfield ofC and m is any irreducible monic polynomial 
over K, then there exists a GE C, algebraic over K, such that a has minimal polyno¬ 
mial m overK. 

Proof. Let a be any zero of m in C. Then in ( a) =0, so the minimal polynomial / 
of a over K divides in. But in is irreducible over K and both / and m are monic; 
therefore / = m. □ 


5.3 Simple Algebraic Extensions 

Next, we describe the structure of the field extension K{a ) : K when a has min¬ 
imal polynomial m over K. We proceed by analogy with a basic concept of number 
theory. Recall from Section 3.5 that for any positive integer n it is possible to perform 
arithmetic modulo n, and that integers a,b are congruent modulo n, written 

a = b (mod n) 
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if a — b is divisible by n. In the same way, given a polynomial m £ K\t], we can calcu¬ 
late with polynomials modulo m. We say that polynomials a,b £ K[t] are congruent 
modulo m, written 

a = b (mod in) 

if a(t) — b(t) is divisible by m{t) in 

Lemma 5.8. Suppose that a i = a 2 (mod m) and b\ = 1)2 (mod in). Then a\ + b\ = 
ai T b) (mod m), and a\b\ = (mod m). 

Proof. We know that a \ — an am and b\ 1)2 = bin for polynomials a,b £ K[t\. 
Now 

(fli + b \) - (t !2 + bi) = (at - 02 ) + {bi - bi) = (a - b)m 

which proves the first statement. For the product, we need a slightly more elaborate 
argument: 


a\bi~ a 2^2 = a\bi— a\b 2 +a\b 2 ~ 02 b) 
= a\(b\ -b 2 ) +b 2 (ai-a 2 ) 
= (a\b + b2a)m 


□ 

Lemma 5.9. Every polynomial a £ K[t] is congruent modulo m to a unique polyno¬ 
mial of degree < dm. 

Proof. Divide a by m with remainder, so that a = qm + r where q. r £ K\t] and dr < 
dm. Then a — r — qm. so a = r (mod in). To prove uniqueness, suppose that r = s 
(mod m) where dr,ds < dm. Then r — s is divisible by m but has smaller degree than 
m. Therefore r — s = 0, so r = s, proving uniqueness. □ 

We call r the reduced form of a modulo in. Lemma 5.9 shows that we can calcu¬ 
late with polynomials modulo m in terms of their reduced forms. Indeed, the reduced 
form of a + b is the reduced form of a plus the reduced form of b, while the reduced 
form of ab is the remainder, after dividing by in. of the product of the reduced form 
of a and the reduced form of b. 

Slightly more abstractly, we can work with equivalence classes. The relation = 
(mod m) is an equivalence relation on K[t], so it partitions K[t\ into equivalence 
classes. We write [a] for the equivalence class of a € K[t], Clearly 

[a\ = {f £ E[t\ : m\(a - f)} 

The sum and product of [a] and [b\ can be defined as: 

[a] + [ b\ = [ a + b\ [a] [/?] = [ab] 

It is straightforward to show that these operations are well-defined; that is, they do 
not depend on the choice of elements from equivalence classes. Each equivalence 
class contains a unique polynomial of degree less than dm, namely, the reduced form 
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of a. Therefore algebraic computations with equivalence classes are the same as com¬ 
putations with reduced forms, and both are the same as computations in K [f] with the 
added convention that m(t) is identified with 0. In particular, the classes [0] and [1] 
are additive and multiplicative identities respectively. 

We write 

K[t]/(m) 

for the set of equivalence classes of K[t] modulo m. Readers who know about ideals 
in rings will see at once that K[t]/ ( m) is a thin disguise for the quotient ring of K[t] 
by the ideal generated by m , and the equivalence classes are cosets of that ideal, but 
at this stage of the book these concepts are more abstract than we really need. 

A key result is: 

Theorem 5.10. Every nonzero element of K[t]/(m) has a multiplicative inverse in 
K[t\/(m) if and only if m is irreducible in K[t], 

Proof If m is reducible then m = ab where da,db < dm. Then \a]\b\ = [ab] = 
[m] = [0]. Suppose that [a] has an inverse [c], so that [c][a] = [1]. Then [0] = [c][0] = 
[c] [a] [b\ = [1] \b\ = [b\, so m divides b. Since db < dm we must have b = 0, so m = 0, 
contradiction. 

If m is irreducible, let a € K[t\ with [a] [0]; that is, m\a. Therefore a is prime to 

m, so their highest common factor is 1. By Theorem 3.9, there exist h.k € K[t] such 
that ha + km = 1. Then [h][a] + [&][m] = [1], but [m] = [0] so [1] = [/r][a] + [k\[m\ = 
[h] [a] + [£] [0] = [h] [a] + [0] = [h][a]. Thus [h] is the required inverse. □ 

Again, in abstract terminology, what we have proved is that K[t]/(m) is a field 
if and only if m is irreducible in K\t\. See Chapter 17 for a full explanation and 
generalisations. 


5.4 Classifying Simple Extensions 

We now demonstrate that the above methods suffice for the construction of all 
possible simple extensions (up to isomorphism). Again transcendental extensions 
are easily dealt with. 

Theorem 5.11. Every simple transcendental extension K(cx) : K is isomorphic to 
the extension K(t) : K of rational expressions in an indeterminate t over K. The 
isomorphism K(t ) —> K(a) can be chosen to map t to <X, and to be the identity on K. 

Proof. Define a map (f> : K(t) —> K(a) by 

mt)/g(t))=m/ 8 (a) 

If g f 0 then g ( a) / 0 (since a is transcendental) so this definition makes sense. It is 
clearly a homomorphism, and a simple calculation shows that it is a monomorphism. 



76 


Simple Extensions 


It is clearly onto, and so is an isomorphism. Further, <j >\k is the identity, so that (j) 
defines an isomorphism of extensions. Finally, <j>(t ) = a. □ 

The classification for simple algebraic extensions is just as straightforward, but 
more interesting: 

Theorem 5.12. Let K(a ) : K be a simple algebraic extension, and let the minimal 
polynomial of (X over K be m. Then K(oc) : K is isomorphic to K[t\/(m) : K. The 
isomorphism K[t]/(m) —> K(a) can be chosen to map t to a (and to be the identity 
on K). 

Proof. The isomorphism is defined by [/?(?)] K > p(oi), where [p(f)] is the equivalence 
class of p(f) (mod m). This map is well-defined because p(a) = 0 if and only if 
m\p. It is clearly a field monomorphism. It maps t to a, and its restriction to K is the 
identity. □ 

Corollary 5.13. Suppose K( a ) : K and K(f3) : K are simple algebraic extensions, 
such that a and 0 have the same minimal polynomial m over K. Then the two exten¬ 
sions are isomorphic, and the isomorphism of the large fields can be taken to map a 
to [5 (and to be the identity on K). 

Proof Both extensions are isomorphic to K[t\/(m). The isomorphisms concerned 
map t to a and t to p respectively. Call them l.j respectively. Then j i 1 is an iso¬ 
morphism from K(a ) to K(fi ) that is the identity on K and maps a to J3. □ 

Lemma 5.14. Let K(a) : K be a simple algebraic extension, let the minimal poly¬ 
nomial of a over K be m, and let dm = n. Then {1, a, ..., a" -1 } is a basis for K (a) 
over K. 

Proof The theorem is a restatement of Lemma 5.9. □ 

For certain later applications we need a slightly stronger version of Theorem 5.12, 
to cover extensions of isomorphic (rather than identical) fields. Before we can state 
the more general theorem we need the following: 

Definition 5.15. Let i : K —> L be a field monomorphism. Then there is a map i : 
K[t\ L[t], defined by 

i (ko + k\t + •••-(- k„t n ) = i (ko) +1 (k\)t + • • • +1 ( k n )t n 

(ko ,.... k n £ K). It is easy to prove that I is a monomorphism. If i is an isomorphism, 
then so is i. 

The hat is unnecessary, once the statement is clear, and it may be dispensed with. 
So in future we use the same symbol I for the map between subfields of C and for its 
extension to polynomial rings. This should not cause confusion since l(k) = l(k) for 
any k £ K. 
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Theorem 5.16. Suppose that K and L are subfields of C and l : K -y L is an isomor¬ 
phism. Let K(cc),L(f5) be simple algebraic extensions of K and L respectively, such 
that a has minimal polynomial m a (t) over K and j3 has minimal polynomial mpf) 
over L. Suppose further that mp(t) = l(m a (t)). Then there exists an isomorphism 
j : K(a) —* L( j3) such that j\x = l and j(a) = fi. 

Proof. We can summarise the hypotheses in the diagram 

K -t K(a) 

U ij 
L L(f}) 

where j is yet to be determined. Using the reduced form, every element of K (a) is 
of the form p(a) for a polynomial p over K of degree < dm a . Define j(p(a )) = 
(l(p))Q3) where t(p) is defined as above. Everything else follows easily from The¬ 
orem 5.12. □ 

The point of this theorem is that the given map l can be extended to a map j 
between the larger fields. Such extension theorems , saying that under suitable condi¬ 
tions maps between sub-objects can be extended to maps between objects, constitute 
important weapons in the mathematician’s armoury. Using them we can extend our 
knowledge from small structures to large ones in a sequence of simple steps. 

Theorem 5.16 implies that under the given hypotheses the extensions K(a ) : K 
and L(/3): L are isomorphic. This allows us to identify K with L and K(a ) with L(fj ), 
via the maps I and j. 

Theorems 5.7 and 5.12 together give a complete characterisation of simple al¬ 
gebraic extensions in terms of polynomials. To each extension corresponds an irre¬ 
ducible monic polynomial, and given the small field and this polynomial, we can 
reconstruct the extension. 


EXERCISES 

5.1 Is the extension Q(v / 5, V^) simple? If so, why? If not, why not? 

5.2 Find the minimal polynomials over the small field of the following elements 
in the following extensions: 

(a) i in C : Q 

(b) i in C : K 

(c) \/2 in R : Q 

(d) (\/5 + l)/2inC:Q 

(e) (i\/3-l)/2inC : Q 
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5.3 Show that if a has minimal polynomial t 2 — 2 over Q and /3 has minimal 
polynomial t 2 — At + 2 over Q, then the extensions Q(a): Q and Q(j3) : Q are 
isomorphic. 

5.4 For which of the following m(t) and K do there exist extensions K(a.) of K for 
which a has minimal polynomial m(t) ? 

(a) m(t) = t 2 — 4,K = K. 

(b) m(t) = t 2 — 3, K = K. 

(c) m(t ) = t 2 — 3,K = Q 

fd) m(t) =t 7 — 3t 6 +4t 3 —t— l,K = K 

5.5 Let K be any subfield of C and let m(t) be a quadratic polynomial over K 
(dm = 2). Show that all zeros of m(t) lie in an extension K(a) of K where 
a 2 = k € K. Thus allowing ‘square roots’ \/k enables us to solve all quadratic 
equations over K. 

5.6 Construct extensions Q(a): Q where a has the following minimal polynomial 
over Q: 

(a) t 2 — 5 

(b) t 4 +t 3 +t 2 +t + 1 

(c) f 3 +2 

5.7 Is Q(-\/2, a/3, a/S) : Q a simple extension? 

5.8 Suppose that m(t) is irreducible over K , and a has minimal polynomial mil) 
over K. Does m(t) necessarily factorise over K(a ) into linear (degree 1) poly¬ 
nomials? (Hint: Try K = Q 1 . a — the real cube root of 2.) 

5.9 Mark the following true or false. 

(a) Every field has non-trivial extensions. 

(b) Every field has non-trivial algebraic extensions. 

(c) Every simple extension is algebraic. 

(d) Every extension is simple. 

(e) All simple algebraic extensions of a given subfield of C are isomorphic. 

(f) All simple transcendental extensions of a given subfield of C are isomor¬ 
phic. 

(g) Every minimal polynomial is monic. 

(h) Monic polynomials are always irreducible. 

(i) Every polynomial is a constant multiple of an irreducible polynomial. 



Chapter 6 
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A technique which has become very useful in mathematics is that of associating 
with a given structure a different one, of a type better understood. In this chapter 
we exploit the technique by associating with any field extension a vector space. This 
places at our disposal the machinery of linear algebra—a very successful algebraic 
theory—and with its aid we can make considerable progress. The machinery is suf¬ 
ficiently powerful to solve three notorious problems which remained unanswered for 
over two thousand years. We shall discuss these problems in the next chapter, and 
devote the present chapter to developing the theory. 


6.1 Definition of the Degree 

It is not hard to define a vector space structure on a field extension. It already has 
one! More precisely: 

Theorem 6.1. IfL : K is afield extension, then the operations 

(A, «) i-> Am (A € K,u G L ) 

(m, v) i u + v (u, v G L) 

define on L the structure of a vector space over K. 

Proof. The set L is a vector space over K if the two operations just defined satisfy 
the following axioms: 

(1) u + v = v + u for all u,v G L. 

(2) (u T v) + w = u T (v + w) for all u, v, w G L. 

(3) There exists 0 G L such that 0 + u = u for all u G L. 

(4) For any u G L there exists —u G L such that u + (— u) = 0. 

(5) If A G K,u,v G L , then A(m +v) = Am + Av. 

(6) If 1 is the multiplicative identity of K , then \ u = it for all u G L. 
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(7) If A,/i £ K, then A (flu) = (A/x)m for all w £ L. 

Each of these statements follows immediately because K and L are subfields of C 
and K CL. □ 

We know that a vector space V over a subfield K of C (indeed over any field, 
but we’re not supposed to know about those yet) is uniquely determined, up to iso¬ 
morphism, by its dimension. The dimension is the number of elements in a basis—a 
subset of vectors that spans V and is linearly independent over K. The following 
definition is the traditional terminology in the context of field extensions: 

Definition 6.2. The degree [L : K] of a field extension L.K is the dimension of L 
considered as a vector space over K. 

Examples 6.3. (1) The complex numbers C are two-dimensional over the real num¬ 
bers R, because a basis is {1, (}. Hence [C : R] = 2. 

(2) The extension Q(i, -\/5) : Q has degree 4. The elements {1, \/5, i,z\/5} form a 
basis for Q (i, -\/5) over Q, by Example 4.8. 

Isomorphic field extensions obviously have the same degree. 


6.2 The Tower Law 

The next theorem lets us calculate the degree of a complicated extension if we 
know the degrees of certain simpler ones. 

Theorem 6.4 (Short Tower Law). If K. L. M are subfields of C and K C L C M, 
then 

[M : K\ = [M: L\ [L : K} 

Note: For those who are happy with infinite cardinals this formula needs no extra 
explanation; the product on the right is just multiplication of cardinals. For those who 
are not, the formula needs interpretation if any of the degrees involved is infinite. This 
interpretation is the obvious one: if either [M : L] or [L : K] = <=° then [M : K] = 
and ii [M : K]=°° then either [M : L] = °° or [L : K] = °°. 

Proof Let (xi), e / be a basis for L as vector space over K and let {yf)jej be a basis 
for M over L. For all i £ I and j £ J we have x, £ L, y 7 £ M. We shall show that 
( x iyj)ieijeJ is a basis for M over K (where xffj is the product in the subfield M). 
Since dimensions are cardinalities of bases, the theorem follows. 

First, we prove linear independence. Suppose that some finite linear combination 
of the putative basis elements is zero; that is, 

Y^kijXiyj — 0 ( kjj £ K) 

i,j 
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We can rearrange this as 

j ' i ' 

Since the coefficients kijXi lie in L and the yj are linearly independent over L, 

Yj k ijXi = 0 

i 

Repeating the argument inside L we find that kij = 0 for all i £ /, j £ J. So the ele¬ 
ments x{yj are linearly independent over K. 

Finally we show that the xffj span M over K. Any element x £ M can be written 

x = L^/fy 

j 

for suitable Xj £ L , since the y ,■ span M over L. Similarly for any j £ J 

Xj = 

i 

for X/ j £ K. Putting the pieces together, 

x =EVw 

ij 


as required. □ 

Example 6.5. Suppose we wish to find [Q(-\/2, x/3) : Q]. It is easy to see that 
{1, y/2} is a basis for Q(v^) over Q. For let a £ Q(v^). Then a = p+qV 2 where 
p. q £ Q, proving that {1, V2} spans Q(v^) over Q. It remains to show that 1 and 
y/2 are linearly independent over Q. Suppose that p + q\/2 = 0, where p, q £ Q. If 
q X 1 0 then \/2 = pjq, which is impossible since \/l is irrational. Therefore q = 0. 
But this implies p = 0. 

In much the same way we can show that {1, y/3} is a basis for Q(v / 2, t/3) over 
Q(v^)-Every element ofQ(v / 2, vT) can be written as p + q\/2 + rV3+sV6 where 
p,q, r,s £ Q. Rewriting this as 

(p + qV 2) + ( r + sV2)V3 

we see that {1, x/3} spans <Q)(y/2, \/3) over Q(v^). To prove linear independence 
we argue much as above: if 

{P + qV 2) + (r + s-y/ 2)\/3 = 0 

then either (r + sy/2) = 0, whence also (p + q\J 2) = 0, or else 
V3 = (p + qV2 )/(r + sV2) £ Q(V2) 

Therefore y/3 = a + b\/2 where a,b £ Q. Squaring, we find that abV 2 is rational. 



82 


The Degree of an Extension 


which is possible only if either a = 0 or b = 0. But then \/3 = a or \/3 = bs/l, both of 
which are absurd. Then (p +qy/2) = (r + sy/2) = 0 and we have proved that {1, \/3} 
is a basis. Hence 

[Q(V2, \/3): Q] = [Q(V2, yfi) : Q(/2)][Q(/2): Q] 

=2x2=4 


The theorem even furnishes a basis for Q(\/2, a/ 3) over Q: form all possible pairs 
of products from the two bases {1, \/2} and {1, \/3}, to get the ‘combined’ basis 

{1, \/2,V / 3, \/6}. 

By induction on n we easily parlay the Short Tower Law into a useful generali¬ 
sation: 

Corollary 6.6 (Tower Law). If Kq CKiC---CK n are subfields of C, then 


[K n : Kq] = [K„ : K n -\}[K n -\ : K n _ 2 ] K 0 ] 


□ 

In order to use the Tower Law we have to get started. The degree of a simple 
extension is fairly easy to find: 

Proposition 6.7. Let K(a) : K be a simple extension. If it is transcendental then 
[K(a) : K] = °°. If it is algebraic then [A'(a) : K} = dm, where m is the minimal 
polynomial of a over K. 

Proof. For the transcendental case it suffices to note that the elements 1, a, a 2 ,... 
are linearly independent over K. For the algebraic case, we appeal to Lemma 5.14. 

□ 

For example, we know that C = R(/) where i has minimal polynomial t 2 + 1, of 
degree 2, Hence [C : IR] = 2, which agrees with our previous remarks. 

Example 6.8. We now illustrate a technique that we shall use, without explicit refer¬ 
ence, whenever we discuss extensions of the form Q(y / aT,..., \/ttn ): Q with rational 
a.j. The technique can be used to prove a general theorem about such extensions, see 
Exercise 6.15. The question we tackle is: find [Qdv 7 / \/3, \/5): Q]. 

By the Tower Law, 

[Q(x/2,x/3,/5):Q] 

= [Q(/2, \/3, V5): Q( y/2, /3)] [Q(y/2, \/3): Q(\/2)] [Q(/2): Q] 

It is ‘obvious’ that each factor equals 2, but it takes some effort to prove it. As a 
cautionary remark: the degree y/lO, %/l5) : Q] is 4, not 8 (Exercise 6.14). 

(a) Certainly [Q(\/2) : Q] =2. 

(b) If a/ 3 ^ Q(y/) then ^(v^, a/ 3) : Q(-\/2)] = 2. So suppose y/3 G Q(-\/2), im¬ 
plying that 


V3=p + qV2 p,q&Q 
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We argue as in Example 6.5. Squaring, 

3 = (p 2 + 2(f) + 2pq\fl 


so 

p 2 + 2 q 2 = 3 pq = 0 

If p = 0 then 2 q 2 = 3, which is impossible by Exercise 1.3. If q = 0 then p 2 = 3, 
which is impossible for the same reason. Therefore \/3 ^ Q(a/ 2), and VS) : 

Q(V2)]=2. 

(c) Finally, we claim that V5 ^ Q(V2, V3). Here we need a new idea. Suppose 
V~5 = p + qV2 + rV3+s\/6 p,q,r,s£ Q 


Squaring: 

5 = p 2 + 2 q 2 + 3r 2 + 6s 2 + (2 pq + 6rs)V2 + (2 pr + 4qs ) V3 + (2 ps + 2 qr) V6 


whence 

p 2 + 2 q 2 + 3 r 2 + 6 s 2 = 5 
pq + 3rs = 0 
pr + 2qs = 0 
ps + qr = 0 


( 6 . 1 ) 


The new idea is to observe that if (p,q,r,s) satisfies (6.1), then so do (p,q,—r,—s), 
( p , - q. r. —,v), and (p, —q. —r, s). Therefore 


p + qV2 + r\/3+sV6 = Vs 
p + q\/2 — rV3 — sa/6 = ±\/5 
p — qV2-\~ rV3 — sV 6 = ±a/5 
p — q\[2 — rVS+sVb = ±\/5 

Adding the first two equations, we get p + qV 2 = 0 or p + qV 2 = a/ 5- The first 
implies that p = q = 0. The second implies that p 2 + 2q 2 + 2pq\/2 = 5, which is 
easily seen to be impossible. Adding the first and third, rV 3 = 0 or rV 3 = a/5, SO 
r = 0. Finally, s = 0 since sV 6 = \/5 is impossible by Exercise 1.3. 

Having proved the claim, we immediately deduce that 

[Q(V2, y/3, V5 ): Q(v / 2, v^)] = 2 

which implies that [Q(-\/2, V3, V5) : Q] = 8. 

Linear algebra is at its most powerful when dealing with finite-dimensional vec¬ 
tor spaces. Accordingly we shall concentrate on field extensions that give rise to such 
vector spaces. 

Definition 6.9. A finite extension is one whose degree is finite. 



84 


The Degree of an Extension 


Proposition 6.7 implies that any simple algebraic extension is finite. The converse 
is not true, but certain partial results are: see Exercise 6.16. In order to state what is 
true we need: 

Definition 6.10. An extension L : K is algebraic if every element of L is algebraic 
over K. 

Algebraic extensions need not be finite, see Exercise 6.11, but every finite exten¬ 
sion is algebraic. More generally: 

Lemma 6.11. An extension L : K is finite if and only if L = K (a. ..., a,-) where r is 
finite and each a, is algebraic over K. 

Proof Induction using Theorem 6.4 and Proposition 6.7 shows that any extension of 
the form K(o.\af) : K for algebraic a ; is finite. 

Conversely, let L : K be a finite extension. Then there is a basis {ai, ..., a s } for 
L over K, whence L = K(o .\, ..., af). Each a,- is clearly algebraic. □ 


EXERCISES 

6.1. Find the degrees of the following extensions: 


(a) 

<C:Q 


(b) 

^(v^): R 


(c) 

Q(a) : Q where 

a is the real cube root of 2 

fd) 

Q(3, y/5, Vn ): 

Q 

(e) 

Q(v / 6): Q 


(f) 

Q(a): Q where 

a 1 = 3 


6.2. Show that every element of Q(\/5, \fl) can be expressed uniquely in the form 

p + qV5 + r\fl + s\/35 

where p, q. r. s <£ Q. Calculate explicitly the inverse of such an element. 

6.3. If [L : K) is a prime number show that the only fields M such that K C M C L 
are K and L themselves. 

6.4. If [L : K] = 1 show that K = L. 

6.5. Write out in detail the inductive proof of Corollary 6.6. 

6.6. Let L.K be an extension. Show that multiplication by a fixed element of L is 
a linear transformation of L considered as a vector space over K. When is this 
linear transformation nonsingular? 
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6.7. Let L.K be a finite extension, and let p be an irreducible polynomial over K. 
Show that if dp does not divide [L : K], then p has no zeros in L. 

6.8. If L : K is algebraic and M : L is algebraic, is M : K algebraic? Note that you 
may not assume the extensions are finite. 

6.9. Prove that Q(\/3, \/5) = Q(\/3 + \/5). Try to generalise your result. 

6.10* Prove that the square roots of all prime numbers are linearly independent over 
Q. Deduce that algebraic extensions need not be finite. 

6.11 Find a basis for Q(yJ (1 + \/3)) over Q and hence find the degree of 
Q(\J (1 + x/3)) : Q. (Hint: You will need to prove that 1 + y/3 is not a square 

in Q(v / 3)-) 

6.12 If [L : K] is prime, show that L is a simple extension of K. 

6.13 Show that [Q(\/6, v'lO, /L5): Q] = 4, not 8. 

6.14* Let K be a subfield of C and let be elements of K such that any 

product cij 1 ■ ■ ■ cij k , with distinct indices j/, is not a square in K. Let a ; - = ^Jaj 
for 1 < j < n. Prove that [K(a\a„): K] = 2”. 

If K = Q, how can we verify the hypotheses on the cij by looking at their prime 
factorisations? 

6.15* Let L.K be an algebraic extension and suppose that K is an infinite field. Prove 
that L.K is simple if and only if there are only finitely many fields M such 
that K CM CL, as follows. 

(a) Assume only finitely many M exist. Use Lemma 6.11 to show that L : K 
is finite. 

(b) Assume L = K(a\, ai). For each /3 € K let Jp = K(a\ + Only 
finitely many distinct Jp can occur: hence show that L = Jp for some j3. 

(c) Use induction to prove the general case. 

fd) For the converse, let L = K(a) be simple algebraic, with K C M C L. 
Let m be the minimal polynomial of a over K , and let % be the min¬ 
imal polynomial of a over M. Show that m*f|w in L[t\. Prove that him 
determines M uniquely, and that only finitely many him can occur. 

6.16 Mark the following true or false. 

(a) Extensions of the same degree are isomorphic. 

(b) Isomorphic extensions have the same degree. 

(c) Every algebraic extension is finite. 

fd) Every transcendental extension is not finite. 
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(e) Every element of C is algebraic over R. 

(f) Every extension of R that is a subfield of C is finite. 

(g) Every algebraic extension of Q is finite. 



Chapter 7 
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Already we are in a position to see some payoff. The degree of a field extension 
is a surprisingly powerful tool. Even before we get into Galois theory proper, we 
can apply the degree to a warm-up problem—indeed, several. The problems come 
from classical Greek geometry, and we will do something much more interesting and 
difficult than solving them. We will prove that no solutions exist, subject to certain 
technical conditions on the permitted methods. 

According to Plato the only ‘perfect’ geometric figures are the straight line and 
the circle. In the most widely known parts of ancient Greek geometry, this belief 
had the effect of restricting the (conceptual) instruments available for performing 
geometric constructions to two: the ruler and the compass. The ruler, furthermore, 
was a single unmarked straight edge. 

Strictly, the term should be ‘pair of compasses’, for the same reason we call a 
single cutting instrument a pair of scissors. However, ‘compass’ is shorter, and there 
is no serious danger of confusion with the navigational instrument that tells you 
which way is north. So ‘compass’ it is. 

With these instruments alone it is possible to perform a wide range of construc¬ 
tions, as Euclid systematically set out in his Elements somewhere around 300 BC. 
This series of books opens with 23 definitions of basic objects ranging from points 
to parallels, five axioms (called ‘postulates’ in the translation by Sir Thomas Heath), 
and five ‘common notions’ about equality and inequality. The first three axioms state 
that certain constructions may be performed: 

(1) To draw a straight line from any point to any point. 

(2) To produce a finite straight line continuously in a straight line. 

(3) To describe a circle with any centre and any distance. 

The first two model the use of a ruler (or straightedge); the third models the use of a 
compass. 

Definition 7.1. A ruler-and-compass construction in the sense of Euclid is a finite 
sequence of operations of the above three types. 

Note the restriction to finite constructions. Infinite constructions can sometimes 
make theoretical sense, and are more powerful: see Exercise 7.12. They provide ar¬ 
bitrarily good approximations if we stop after a finite number of steps. 

Later Greek geometry introduced other ‘drawing instruments’, such as conic sec¬ 
tions and a curve called the quadratrix. But long-standing tradition associates Euclid 
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with geometric constructions carried out using an unmarked ruler and a compass. 
The Elements includes ruler-and-compass constructions to bisect a line or an angle, 
to divide a line into any specified number of equal parts, and to draw a regular pen¬ 
tagon. 

However, there are many geometric problems that clearly ‘should’ have solu¬ 
tions, but for which the tools of ruler and compasses are inadequate. In particular, 
there are three famous constructions which the Greeks could not perform using these 
tools: Duplicating the Cube, Trisecting the Angle, and Squaring the Circle. These 
ask respectively for a cube twice the volume of a given cube, an angle one-third the 
size of a given angle, and a square of area equal to a given circle. 

It seems likely that Euclid would have included such constructions if he knew 
any, and it is a measure of his mathematical taste that he did not present fallacious 
constructions that are approximately correct but not exact. The Greeks were inge¬ 
nious enough to find exact constructions if they existed, unless they had to be ex¬ 
traordinarily complicated. (The construction of a regular 17-gon is an example of a 
complicated construction that they missed: see Chapter 19.) We now know why they 
failed to find ruler-and-compass constructions for the three classical problems: they 
don’t exist. But the Greeks lacked the algebraic techniques needed to prove that. 

The impossibility of trisecting an arbitrary angle using ruler and compass was 
not proved until 1798 when Gauss was writing his Disquisitiones Arithmeticae, pub¬ 
lished in 1801. Discussing his construction of the regular 17-gon, he states without 
proof that such constructions do not exist for the 9-gon, 25-gon, and other numbers 
that are not a power of 2 times a product of distinct Fermat primes—those of the 
form 2 2 + 1. He also writes that he can ‘prove in all rigour that these higher-degree 
equations [involved in the construction] cannot be avoided in any way’, but adds ‘the 
limits of the present work exclude this demonstration here.’ Constructing the regular 
9-gon is clearly equivalent to trisecting so Gauss’s claim disposes of trisections. 
He did not publish a proof; the first person to do so was Pierre Wantzel in 1837. 

This result does not imply that an angle one third the size of a given one does 
not exist, or that practical constructions with very small errors cannot be devised; it 
tells us that the specified instruments are inadequate to find it exactly. Wantzel also 
proved that it is impossible to duplicate the cube with ruler and compass. Squaring 
the circle had to wait even longer for an impossibility proof. 

In this chapter we mention approximate constructions, which are entirely accept¬ 
able for practical work. We make some brief historical remarks to point out that the 
Greeks could solve the three classical problems using ‘instruments’ that went be¬ 
yond just ruler and compass. We identify the Euclidean plane R 2 with the complex 
plane C, which lets us avoid considering the two coordinates of a point separately 
and greatly simplifies the discussion. We formalise the concept of ruler-and-compass 
construction by defining the notion of a constructible point in C. We introduce a 
series of specific constructions that correspond to field operations , x,/) and 
square roots in C. We characterise constructible points in terms of the ‘Pythagorean 
closure’ Q py of Q, and deduce a simple algebraic criterion for a point to be con¬ 
structible. By applying this criterion, we prove that the three classical problems can- 
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not be solved by mler-and-compass construction. We also prove that there is no such 
construction for a regular heptagon (7-sided polygon). 


7.1 Approximate Constructions and More General Instruments 

For the technical drawing expert we emphasise that we are discussing exact con¬ 
structions. There are many approximate constructions for trisecting the angle, for 
instance, but no exact methods. Dudley (1987) is a fascinating collection of approxi¬ 
mate methods that were thought by their inventors to be exact. Figure 10 is a typical 
example. To trisect angle BOA, draw line BE parallel to OA. Mark off AC and CD 
equal to OA, draw arc DE with centre C and radius CD. Drop a perpendicular EF to 
OD and draw arc FT centre O radius OF to meet BE at T. Then angle AOT approxi¬ 
mately trisects angle BOA. See Exercise 7.10. 


B_ T E 


• 1 1 

O A C F D 

FIGURE 10: Close—but no banana. 

The Greeks were well aware that by going outside the Platonic constraints, all 
three classical problems can be solved. Archimedes and others knew that angles can 
be trisected using a marked ruler, as in Figure 11. The ruler has marked on it two 
points distance r apart. Given ZAOB = 0 draw a circle centre O with radius r, cutting 
OA at X, OB at Y. Place the ruler with its edge through X and one mark on the line 
OY at D; slide it until the other marked point lies on the circle at E. Then ZEDO 
= 0/3. For a proof, see Exercise 7.3. Exercise 7.14 shows how to duplicate the cube 
using a marked ruler. 

Setting your compasses up against the ruler so that the pivot point and the pen¬ 
cil effectively constitute such marks also provides a trisection, but again this goes 
beyond the precise concept of a ‘ruler-and-compass construction’. Many other uses 
of ‘exotic’ instruments are catalogued in Dudley (1987), which examines the history 
of trisection attempts. Euclid may have limited himself to an unmarked ruler (plus 
compasses) because it made his axiomatic treatment more convincing. It is not en¬ 
tirely clear what conditions should apply to a marked ruler—the distance between 
the marks causes difficulties. Presumably it ought to be constructible, for example. 

The Greeks solved all three problems using conic sections, or more recondite 
curves such as the conchoid of Nichomedes or the quadratrix (Klein 1962, Coolidge 



90 


Ruler-and-Compass Constructions 


B 


A 



FIGURE 11: Trisecting an angle with a marked ruler. 


1963). Archimedes tackled the problem of Squaring the Circle in a characteristically 
ingenious manner, and proved a result which would now be written 

3$ <tt<3\ 

This was a remarkable achievement with the limited techniques available, and refine¬ 
ments of his method can approximate n to any required degree of precision. 

Such extensions of the apparatus solve the practical problem, but it is the theo¬ 
retical one that holds the most interest. What, precisely, are the limitations on ruler- 
and-compass constructions? With the machinery now at our disposal it is relatively 
simple to characterise these limitations, and thereby give a complete answer to all 
three problems. We use coordinate geometry to express problems in algebraic terms, 
and apply the theory of field extensions to the algebraic questions that arise. 


7.2 Constructions in C 

We begin by formalising the notion of a ruler-and-compass construction. Assume 
that initially we are given two distinct points in the plane. Equivalently, by Euclid’s 
Axiom 1, we can begin with the line segment that joins them. These points let us 
choose an origin and set a scale. So we can identify the Euclidean plane R 2 with C, 
and assume that these two points are 0 and 1. 

Euclid dealt with finite line segments (his condition (1) above) but could make 
them as long as he pleased by extending the line (condition (2)). We find it more 
convenient to work with infinitely long lines (modelling an infinitely long ruler), 
which in effect combines Euclid’s conditions into just one: the possibility of drawing 
the (infinitely long) line that passes through two given points. From now on, ‘line’ is 
always used in this sense. 
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If zi ,Z 2 G C and 0 < r £ M, define 

L(zi,zi) = the line joining z\ to zi (zi ^ zi) 

C(zi, r) = the circle centre z t with radius r > 0 

We now define constructible points, lines, and circles recursively: 

Definition 7.2. For each n £ N define sets . fi£ n , and ( C n of n-constructible points, 

lines , and circles, by: 

= {0,1} 

^o = 0 
%?0 = ® 

^1+1 = {L(zi,Z 2) -Zl,Z2 G 

&n +1 = |C(Z 1 ,|Z2-Z3|) : Zl,Z2,Z3 G 

= {z G C : z lies on two distinct lines in J£ n +\} U 

{z G C : z lies on a line in ££ n +\ and a circle in c tf n+ 1 } U 
{z G C : z lies on two distinct circles in J ? n +\} 


\+i43 



Figure 12 shows that 


^ = {- 1 , 0 , 1 , 2 , 


1 ±iVl 
2 


} 


Lemma 7.3. For all n £ N, 


C &> n+l se n c 22 ?„ +1 % c tf B+1 


and each is a finite set. 
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Proof. The inclusions are clear. Let p„ be the number of points in 3^ n , l n the number 
of lines in <i?„, and c„ the number of circles in 'C n . Then 

\Xi+l\ < \Pn{Pn + 1) 

\^n+l\ < Pn\Pn{Pn + l) 

\^n+l\ — \ln+\{Jn+\ T 1 ) T 2.l n C n T (c n +l T 1 ) 

bearing in mind that a line or circle meets a distinct circle in < 2 points. By induction, 
all three sets are finite for all n. □ 

We formalise a Euclidean ruler-and-compass construction using these sets. The 
intuitive idea is that starting from 0 and 1, such a construction generates a finite 
sequence of points by drawing a line through two previously constructed points, or 
a circle whose centre is one previously constructed point and whose radius is the 
distance between two previously constructed points, and then defining a new point 
using intersections of these. 

Definition 7.4. A point z £ C is constructible if there is a finite sequence of points 

ZO = 0,Z 1 = l,Z2)Z3,---Zt = Z (7.1) 

such that Zj+i lies in at least one of: 


L {z h ,z j2 )CL(z h ,z k ) 

L (zj l ,Zj2)nC(z h ,\zj 4 -Zj 5 \) 

c (z;i. \zj 2 - *h I ) nc (z/4>|z/5 - Zj 6 1) 


where all /,■ < j and the intersecting lines and circles are distinct. 

In the first case, the lines must not be parallel in order to have non-empty in¬ 
tersection; in the other cases, the line must meet the circle and the two circles must 
meet. These technical conditions can be expressed as algebraic properties of the z,j. 

We can now prove: 

Theorem 7.5. A point z& C is constructible if and only if z G 2P„for some n £ N. 

Proof Let z £ C be constructible, using the sequence (7.1). Inductively, it is clear 
that z = Zk£ £?k- 

Conversely, let z £ 2?k- Then we can find a sequence z,j £ £?p where 0 < j < k, 
satisfying (7.1). □ 

To characterise constructible points, we need: 

Definition 7.6. The Pythagorean closure Q py of Q is the smallest subfield K C C 
with the property: 

z£K => € K (7.2) 

The Pythagorean closure of Q exists because every subfield of C contains Q, so Q py 
is the intersection of all subfields of C satisfying (7.2). 
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The main theorem of this section is: 

Theorem 7.7. A point zS C is constructible if and only if z £ Q py . Equivalently, 

oo 

U &n = Q Py (7.3) 

72=0 


Pre-proof Discussion. 

We can summarise the main idea succinctly. Coordinate geometry in C shows that 
each step in a ruler-and-compass construction leads to points that can be expressed 
using rational functions of the previously constructed points together with the square 
root of a rational function of those points. Conversely, all rational functions of given 
points can be constructed, and so can square roots of given points. Therefore anything 
that can be constructed lies in Q py , and anything in Q py can be constructed. 

The details require some algebraic computations in C and some basic Euclidean 
geometry. We prove Theorem 7.7 in two stages. In this section we show that 

(A) &>„ C QPy for all n £ N. 

In the next section, after describing some basic constructions for arithmetical opera¬ 
tions and square roots, we complete the proof by establishing 

(B) If z £ Q py then z £ for some n £ N. 

Equation (7.3) is an immediate consequence of (A) and (B). 

Proof of Part (A). Part (A) follows by coordinate geometry in C = R 2 . The details 
are tedious, but we give them for completeness. Use induction on n. Since .y’o = 
{0,1} C Q, we have £ z. Suppose inductively that 2? n C Q py , and let z £ &n +1 • 
We have to prove that z £ Q py . 

There are three cases: line meets line, line meets circle, circle meets circle. 

Case 1 : Line meets line. Here {z} = L(zi,Zi) HL(z 3 ,Z 4 ) where the Zj £ £? n C Q py 
(induction hypothesis) and the lines are distinct. Therefore there exist real a,f> such 
that 


z = ttzi + (1 — a)z 2 
z = ) 3 z 3 + (1 ~P)Z4 


Therefore 


a = 


P(Z3-Z4)+Z4~Z2 


Z 1 ~Z2 


Since af 3 £ R, we also have 


a = 


P(Z3~Z4)+Z4-Z2 


Z 1 ~Z2 


where the bar is complex conjugate. These two equations have a unique solution for 
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а, j3 because we are assuming that the lines meet at a unique point z, and the solution 
is: 

a _ Z 2 (Z4 - Z3 ) + Z 2 (Z3 - Z4) - Z 3 Z 4 + Z 4 Z 3 
(Zl Z 2 ) (Z3 -Z4) + (Z4-Z3)(Z1 ~Z 2 ) 

P _ Z3 (zi Z 2 ) + Z3(Z2 Zl) Z 2 Z 1 T Z 1 Z 2 
(Z4 - Z3) (Z2 - Zl) + (Zl - Z2) (Z4 “ Z3 ) 

so a,p G Q py . Then z = azi + (1 - «)z 2 € Q py . 

Case 2: Line meets circle. Here z G C(zi,Z 2 ) DC(z 3 , |z 4 — Zs|) where the G 
C Q py (induction hypothesis). Let r = |z 4 — zs|. There exist a,0£R such that 

z = azi + (1 — CCZ 2 ) 

Z = Z 3 + re ,e 

Therefore 

«(zi-Z2)+Z2 = Z 3 + re' e 
«(zi-Z2)+Z2 = Z 3 +re~' 0 

where we take the complex conjugate to get the second equation. We can eliminate 
6 to get 

(«(zi —Z2)+Z2-Z3)(«(zi -Z2)+Z2-Z3) = re ,0 .re~' 0 =r = (z 4 -Z 5 )(z4 -Zs) 

which is a quadratic equation for a with coefficients in Q py . Since the quadratic 
formula involves only rational functions of the coefficients and a square root, a € 
Q py . Therefore z G Q py - 

Case 3: Circle meets circle. Here z G C(zi, \zi — Z 3 1) nC(z 4 , |zs — ze|) where the 
Zj G C Q py (induction hypothesis). Let r = |z 2 — Z 3 |,s = |zs — Z(,\. There exist 

б , 0 G K. such that 

Z = Zl +re' e 
Z = Z4 + se'^ 

Take conjugates and eliminate 9,(j) as above to get 

(z Zl) (z Zi) = r 2 
(z Z 4 ) (z Z 4 ) = s 2 

Solving for z and z (left as an exercise) we find that z satisifies a quadratic equation 
with coefficients in Q py . Therefore z G Q py . □ 


7.3 Specific Constructions 

To prove the converse (B) above we first discuss constructions that implement 
algebraic operations and square roots in C. The next lemma begins the process of 
assembling useful constructions and bounding the number of steps they require. 



Specific Constructions 


95 


Lemma 7.8. (1) A line can be bisected using a 2-step construction. 

(2) An angle can be bisected using a 2-step construction. 

(3) An angle can be copied (so that its vertex is a given point and one leg lies 
along a given line through that point) using a 3-step construction. 

(4) A perpendicular to a given line at a given point can be constructed using a 
2-step construction. 

Proof. See Figure 13 for diagrams. 

(1) Let the line be L[z,w\. 

Draw circles C[z, \z — w|] and C[w , \z — vv|]. These meet at two points u,v. 

The midpoint p of L\z. w] is its intersection with L[u,v\. 

(2) Let 8 be the angle between L[a,b\ and L[a,c\. 

Draw C[a , 1] meeting L[a,b] at p and L[a,c\ at q. 

Draw C\p, 1] and C[q, 1] meeting at s,t. Then L[a,s] (or L[af}) bisects 8. 

(3) Let 8 be the angle between L[a,b] and L[a,c\. 

Suppose p,q £ C are given, and we wish to construct angle 8 at p with one side 
L[p,q}. 

Let C[a, 1] meet L[a,b] at d and L[a,c\ at e. 

Let L[p, 1] meet L[p,q\ at s. 

Let C[s, | d — e\\ meet C[p, 1] at t as shown. Then the angle between L[p,t\ and 
L[p,q] is 8 for the appropriate choice of t. 

(4) Let a lie on a line L. Let the circle C[a. 1 ] meet L at b. c. 

Let C[b,\b — c|] meet C[c, |Z? — c|] at p, q. 

Then L[p. q] is the required perpendicular. □ 

The next lemma continues the process of collecting useful constructions. 

Lemma 7.9. (1) A parallel to a given line through a given point not on that line 

can be constructed using a 3-step construction. 

(2) A triangle similar to a given triangle, with one edge prescribed, can be con¬ 
structed using a 7-step construction. 

Proof. See Figure 14 for diagrams. 

(1) Let the line be L[a,b\ and let p £ C be a point that does not lie on the line. Using 
Lemma 7.8(3), copy the angle between L[a,b] and L[a,p\ to vertex p , with one leg 
lying along L[a,p] produced. The other leg is then parallel to L[a,b\. 

(2) Let the vertices of the first triangle be a,b,c. Suppose two vertices p,q of the 
required similar triangle are given, such that the similarity maps atop and b to q. 

Using Lemma 7.8(3), copy angles 0,0 at a.b to locations p.q, with one leg of 
each lying along L[p,q], Then the other legs meet at s, which is the third vertex of 
the similar triangle required. □ 

We can now prove the existence of constructions that produce useful algebra 
results: 
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FIGURE 13: Four basic constructions. Top left: Bisecting aline. Top right: Bisecting 
an angle. Bottom left: Copying an angle. Bottom right: Constructing a perpendicular. 



FIGURE 14: Left: Constructing a parallel. Right: Constructing a similar triangle. 


Theorem 7.10. Let z, w £ C. Then, assuming z and w are already constructed: 

( 1 ) Z + w can be constructed using a 7-step construction. 

(2) — z can be constructed using a 1 -step construction. 

(3) zw can be constructed using a 7-step construction. 

(4) 1 jz can be constructed using an 8-step construction. 

(5) ±^/z can be constructed using an 8-step construction. 

Proof. See Figure 15 for diagrams. 

(1) If z,w are not collinear with 0, complete the parallelogram with vertices 0,z, w. 
The remaining vertex is z + w. 
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If z,w are collinear with 0, circle C[z, |w|] meets L[0,z] in two points, z + w and 
z — w. 

(2) The circle C[0, |z|] meets the line L[0,z] at z and at — z. 

(3) Consider the triangle T with vertices 0, l,z. Construct point p so that the triangle 
with vertices 0 ,w,p is similar to T. 

We claim that p = zw. By similarity |p|/|w| = |z|/l, so \p\ = |z||w|. Further, 
arg(p) = argz + arg vv, where arg denotes the argument. Therefore p = zw. 

(4) Let C[0,1] meet L[0,z] at p (with 0 lying between z and p). Then \p\ = 1. 
Construct a triangle with vertices 0 ,p,q similar to 0 ,z, 1. Then \q\/\ = |p|/|z| = 

VN.so \q = \/\z\. 

Let C[0,g] meet L[p,z] at s, on the same side of the origin as p. Then |s| = 1 /\z\ 
and arg(s) = n + arg(z), so p = 1/z. 

(5) Let z = e' 0 . Then ^fz = ! 2 ,c l(7l+e I 1 ' 1 . So we have to bisect 6 and construct 

v^e R+. 

Use C[0,1] to construct —1. 

Bisect L[— 1, r] to get a = (r — l)/2. 

Construct the perpendicular P to L[0,1] at 0. 

Let circle C[a, \r — a\\ meet P at s. Then the intersecting chords theorem (or a 
short calculation with coordinates) implies that s.s = l.r, so s = \fr. 

Construct line L through 0 bisecting the angle between L[0,r] and L[0,z]. 

This meets the circle C[0, |s|] at ±^/z. For the other square root use (2) above. □ 

Next we characterise the elements of Q py in terms of field extensions. 

Theorem 7.11. A complex number a is an element o/Q py if and only if there is a 
tower of field extensions 


Q = K 0 CK 1 C...CK n =Q(a) 


such that 

[Kj +1 :Kj\= 2 

for 0 < j < n— 1 . 

Proof. First, suppose such a tower exists. We prove by induction on j that Kj C Q py . 
This is clear for j = 0. Now, Kj \ is an extension of Kj of degree 2, so Kj + \ = K f fj) 
where the minimum polynomial of p over Kj is quadratic. Since quadratics can be 
solved by extracting square roots, p € Q py , so Kj + \ C Q py . Therefore a G Q py . 

Next, suppose that a G Q py . We prove that such a tower exists. By the definition 
of Q py there is a tower 


Q = L 0 CL, C...CL„DQ(a) 
such that [Lj+i : Lj] = 2 for 0 < j < n — 1. Define 

Mj =L,-nQ(a) 

Consider the Lj and Mj as vector spaces over Q, and note that they are finite¬ 
dimensional. We have dimL /+ i = 2dimL ; for all relevant j. Therefore either A7 ;+ 1 = 
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Mj or dim/W /+ i = 2dim Mj. Delete Mj+\ if it equals Mj and renumber the resulting 
Mj as Kq,K\ ,..., K n , with Kq = Q. Clearly K„ = Q(a). □ 

From this we immediately deduce a simple necessary condition for a point to be 
constructible: 

Theorem 7.12. If a is constructible then [Q(a) : Q] is a power of 2. □ 

Now we are ready for the: 

Proof Proof of Part (B) To complete the proof, we must prove (B). If z £ Q py 
then there is a finite sequence of points zo = 0,zi = 1 ,...Zk = z such that zi+\ £ 
Q(zo,...,z/,a) where a 2 € Q(zo,■ • •,Z/). Inductively, zi is constructible by Theo¬ 
rem 7.10, so zi+ 1 is constructible. □ 



FIGURE 15: Constructions for five operations. Top left: z+w. Top right: — z. Middle: 
zw. Bottom left: 1 /z. Bottom right: ±y/z. 
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7.4 Impossibility Proofs 

We now apply the above theory to prove that there do not exist ruler-and-compass 
constructions that solve the three classical problems mentioned in the introduction 
to this chapter. 

We first prove the impossibility of Duplicating the Cube, where the method is 
especially straightforward. 

Theorem 7.13. The cube cannot be duplicated by ruler and compass construction. 

Proof. Duplicating the cube is equivalent to constructing a = \/2. Suppose for a 
contradiction that a £ Q py , and let m be its minimum polynomial over Q. By Theo¬ 
rem 7.12, dm = 2 k for some k. 

However, since a 3 = 2, the minimum polynomial of a divides x 3 — 2. But this 
is irreducible over Q. If not, it would have a linear factor x — a with a G Q, and then 
a 3 = 2, so a = a. But a is irrational.Therefore dm = 3, which is not a power of 2, 
contradicting Theorem 7.12. □ 

Some angles can be trisected, for example 7 t/ 2. However, the required construc¬ 
tion should work for any angle, so to prove impossibility it is enough to exhibit one 
specific angle that cannot be trisected. We prove: 

Theorem 7.14. There exists an angle that cannot be trisected by ruler-and-compass 
construction. 

Proof. We prove something more specific: the angle ?? cannot be trisected. We know 
that co = e 27n / 3 £ Q py , since co = Suppose for a contradiction that such a 

construction exists. Then £ = er K '/ 9 g QPy. Therefore a = £ + £ _1 € Q py , so its 
minimum polynomial m over Q has degree dm = 2 k for some k. Now £ 3 = ft) and 
ft) 2 + co + 1 = 0, so £ 6 + £ 3 + 1 = 0. Therefore £ 6 + tf 3 = — 1. But 

a 3 = (C + r 1 ) 3 

= c 3 +3c+ 3 r‘+r 3 
= C 3 + 3C+3C“' + C 6 

= 3a — 1 

Therefore m divides x 3 3x + 1. But this is irreducible over Q by Gauss’s lemma, so 
m = x 3 3x + 1 and dm = 3, contradicting Theorem 7.12. □ 

This is the place for a word of warning to would-be trisectors, who are often 
aware of Wantzel’s impossibility proof but somehow imagine that they can succeed 
despite it (Dudley 1987). If you claim a trisection of a general angle using ruler and 
compasses according to our standing conventions (such as ‘unmarked ruler’) then 
you are in particular claiming a trisection of 7r/3 using those instruments. The above 
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proof shows that you are therefore claiming that 3 is a power of 2; in particular, since 
3 / 1, you are claiming that 3 is an even number. 

Do you really want to go down in history as believing you have proved this? 

The final problem of antiquity is more difficult: 

Theorem 7.15. The circle cannot be squared using ruler-and-compass construc¬ 
tions. 

Proof. Such a construction is equivalent to constructing the point (0, \fn) from the 
initial set of points Po = {(0,0), (1,0)}. From this we can easily construct (0, n). So 
if such a construction exists, then [Q(?r) : Q] is a power of 2, and in particular n 
is algebraic over Q. On the other hand, a famous theorem of Ferdinand Lindemann 
asserts that K is not algebraic over Q. The theorem follows. □ 

We prove Lindemann’s theorem in Chapter 24. We could give the proof now, but 
it involves ideas off the main track of the book, and has therefore been placed in the 
Chapter 24. If you are willing to take the result on trust, you can skip the proof. 

As a bonus, and to set the scene for Chapter 19 on regular polygons, we dispose 
of another construction that the ancients might well have wondered about. They knew 
constructions for regular polygons with 3, 4, 5, sides, and it is easy to double these to 
get 6, 8, 10, 12, 16, 20, and so on. The impossibility of trisecting 2n/3 also proves 
that a regular 9-gon (enneagon) cannot be constructed with ruler and compass. But 
the first ‘missing’ case is the regular 7-gon (heptagon). Our methods easily prove this 
impossible, too: 

Theorem 7.16. The regular 7-gon (heptagon) cannot be constructed with ruler and 
compass. 

Proof. Constructing the regular heptagon is equivalent to proving that 

£ = e 2 * 1 '/ 7 g QPy 

and this complex 7th root of unity satisfies the polynomial equation 

C 6 + C 5 + C 4 + C 3 + C 2 + C + 1 = 0 

because £ 7 — 1 = 0 and the polynomial t 1 — 1 factorises as 

t 1 - 1 = (t- l)(t 6 +t 5 +t 4 + t 3 +t 2 +t + 1) 

Since 7 is prime. Lemma 3.22, implies that t 6 + t 5 + f 4 + f 3 +t 2 +t + 1 is irreducible. 
Its degree is 6, which is not a power of 2, so the regular 7-gon is not constructible. 

There is an alternative approach in this case, which does not appeal to Eisen- 
stein’s Criterion. Rewrite the above equation as 

c 3 +c 2 +c+i+r i +r 2 +r 3 =o 

Now £ € Q py if and only if a = £ + e Q py , as above. Observe that 

a 3 = C 3 + 3C + 3C" 1 + r 

a 2 = C 2 + 2 + C“ 2 


3 
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so 

a 3 + a 2 -3a- 1 = 0 


The polynomial x 3 • x 2 — 3x— 1 is irreducible by Gauss’s Lemma, Lemma 3.17, so 
the degree of the minimum polynomial of a over Q is 3. Therefore a f Q py . □ 


7.5 Construction From a Given Set of Points 

There is a ‘relative’ version of the theory of this chapter, in which we start not 
with {0,1} but some finite subset PC C, satisfying some simple technical condi¬ 
tions. This set-up is more appropriate for discussing constructions such as ‘given an 
angle, bisect it’, without assuming that the original angle is itself constructible. In 
this context. Definition 7.4 is modified to: 

Definition 7.17. Let P be a finite subset of C containing at least two distinct el¬ 
ements, with 0,1 6 P (to identify the plane with C). For each n £ N define sets 
2P n ,F£ n , and ( C n of points, lines, and circles that are n-constructible from P by: 

=P 
&o = 0 
%?o = 0 

■&n +1 = {L(z\,zi) :zi,Z2 G &n} 

e £n+ 1 = {C(zt,|z2-Z3|) : Zl,Z2,Z3 G P?„} 

= {z G C : z lies on two distinct lines in } U 

{z G C : z lies on a line in J£ n+ \ and a circle in U 

{z G C : z lies on two distinct circles in J ? n +\} 

A point is constructible from P if it is n-constructible from P for some n. 

The entire theory then goes through, with essentially the same proofs, except that 
the ground field Q must be replaced by Q(P) throughout. The constructible points 
are precisely those in Q(P) py , defined in the obvious way, and they are characterised 
by the existence of a tower of sub fields of C starting from Q(P) such that each 
successive extension has degree 2. More precisely. Theorem 7.11 becomes 

Theorem 7.18. A complex number a is an element o/Q(P) py if and only if there is 
a tower of field extensions 

Q(P) =K 0 CKi c...CK n =Q{a) 

such that 


for 0 < j < n — 1. 

The proof is the same. 


[K j+1 :Kj]=2 
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EXERCISES 

7.1 Express in the language of this chapter methods of constructing, by ruler and 
compasses: 

(a) The perpendicular bisector of a line. 

(b) The points trisecting a line. 

(c) Division of a line into n equal parts, 
fd) The tangent to a circle at a given point. 

(e) Common tangents to two circles. 

7.2 Estimate the degrees of the field extensions corresponding to the constructions 
in Exercise 7.1, by giving reasonably good upper bounds. 

7.3 Prove using Euclidean geometry that the ‘marked ruler’ construction of Fig¬ 
ure 11 does indeed trisect the given angle AOB. 

7.4 Can the angle 2n/5 be trisected using ruler and compasses? 

7.5 Show that it is impossible to construct a regular 9-gon using ruler and com¬ 
passes. 

7.6 By considering a formula for cos 50 find a construction for the regular pen¬ 
tagon. 

7.7 Prove that the angle 0 can be trisected by ruler and compasses if and only if 
the polynomial 

4 f 3 — 3f — cos0 

is reducible over Q(cos 0). 

7.8 Verify the following approximate construction for n due to Ramanujan (1962, 
p. 35), see Figure 16. Let AB be the diameter of a circle centre O. Bisect AO 
at M, trisect OB at T. Draw TP perpendicular to AB meeting the circle at P. 
Draw BQ = PT, and join AQ. Draw OS, TR parallel to BQ. Draw AD = AS, 
and AC = RS tangential to the circle at A. Join BC, BD, CD. Make BE = BM. 
Draw EX parallel to CD. Then the square on BX has approximately the same 
area as the circle. 

(You will need to know that n is approximately . This approximation is first 
found in the works of the Chinese astronomer Zu Chongzhi in about AD 450.) 

7.9 Prove that the construction in Figure 10 is correct if and only if the identity 

0 sin 0 
sin — = --- 


holds. Disprove the identity and estimate the error in the construction. 
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FIGURE 16: Srinivasa Ramanujan’s approximate squaring of the circle. 


7.10 Show that the ‘compasses’ operation can be replaced by ‘draw a circle centre 
P 0 and passing through some point other than /V without altering the set of 
constructible points. 

7.11 Find a construction with infinitely many steps that trisects any given angle 
0, in the sense that the angle <j>„ obtained by stopping the construction after n 
steps converges to (j) = 6/3 when n tends to infinity. {Hint: consider the infinite 
series 

1 1 1 

7 + XX + 77 H- 

4 16 64 

which converges to j.) 

7.12 A race of alien creatures living in n-dimensional hyperspace W wishes to du¬ 
plicate the hypercube by ruler-and-compass construction. For which n can they 
succeed? 

7.13 Figure 17 shows a regular hexagon of side AB = 1 and some related lines. If 
XY = 1, show that YB = \/2. Deduce that the cube can be duplicated using a 
marked ruler. 

7.14 Since the angles f i f + x > f + T are distinct, but equal 6 when multiplied 
by 3, it can be argued that every angle has three distinct trisections. Show that 
Archimedes’s construction with a marked ruler (Figure 11) can find them all. 

7.15 Prove that the regular 11-gon cannot be constructed with ruler and compass. 
[Hint: Let £ = e 271 '/ 11 and mimic the proof for a heptagon.] 

7.16 Prove that the regular 13-gon cannot be constructed with ruler and compass. 
[Hint: Let £ = e 271 '/ 13 and mimic the proof for a heptagon.] 

7.17 The regular 15-gon and 16-gon can be constructed with ruler and compass. So 
the next regular polygon to consider is the 17-gon. 
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FIGURE 17: Duplicating the cube using a marked ruler. 


Why does the method used in the previous questions fail for the 17-gon? 

7.18* Prove that an angle (which you must specify and which must itself be con- 
structible) cannot be divided into five equal pieces with ruler and compass. 
[Hint: Do not start with 2 jt/ 3 or n/2, both of which can be divided into five 
equal pieces with ruler and compass (why?).] 

7.19 If a g Q, prove that the angle 0 such that tan 0 = a is constructible. 

7.20* Let 6 be such that tan 0 = a/b where a.h g Z are coprime and b ^ 0. Prove 
the following: 

(a) If a + b is odd, then 6 can be trisected using ruler and compass if and only 
if a 2 + b 2 is a perfect cube. 

(b) If a + /; is even, then 0 can be trisected using ruler and compass if and only 
if (a 2 + b 2 )/2 is a perfect cube. 

(c) The angles tan -1 2/11 and tan -1 9/13 can be trisected using ruler and com¬ 
pass. 

[Hint: Use the fact that the ring of Gaussian integers Z[i\ = {p + iq : p,q g 
Z} has the property of unique prime factorisation, together with the standard 
formula for tan30 in terms of tan0.] 

This Exercise is based on Chang and Gordon (2014). 

7.21 Mark the following true or false. 

(a) There exist mler-and-compass constructions trisecting the angle to an 
arbitrary degree of approximation. 

(b) Such constructions are sufficient for practical purposes but insufficient 
for mathematical ones. 

(c) A point is constructible if it lies in a subfield of C whose degree over Q 
is a power of 2. 

(d) The angle n cannot be trisected using ruler and compass. 

(e) A line of length n cannot be constructed using ruler and compass. 
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(f) It is impossible to triplicate the cube (that is, construct one with three 
times the volume of a given cube) by ruler and compass. 

(g) The real number n is transcendental over Q. 

(h) The real number n is transcendental over R. 

(i) If a cannot be constructed by ruler and compass, then a is transcendental 
over Q. 
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Chapter 8 

The Idea Behind Galois Theory 


Having satisfied ourselves that field extensions are good for something, we can focus 
on the main theme of this book: the elusive quintic, and Galois’s deep insights into 
the solubility of equations by radicals. We start by outlining the main theorem that 
we wish to prove, and the steps required to prove it. We also explain where it came 
from. 

We have already associated a vector space to each field extension. For some prob¬ 
lems this is too coarse an instrument; it measures the size of the extension, but not its 
shape, so to speak. Galois went deeper into the structure. To any polynomial p £ C[f], 
he associated a group of permutations, now called the Galois group of p in his hon¬ 
our. Complicated questions about the polynomial can sometimes be reduced to much 
simpler questions about the group—especially when it comes to solution by radi¬ 
cals. What makes his work so astonishing is that in his day the group concept existed 
only in rudimentary form. Others had investigated ideas that we now interpret as 
early examples of groups, but Galois was arguably the first to recogne the concept in 
sufficient generality, and to understand its importance. 

We introduce the main ideas in a very simple context—a quartic polynomial 
equation whose roots are obvious. We show that the reason for the roots being ob¬ 
vious can be stated in terms of the symmetries of the polynomial—in an appropriate 
sense—and that any polynomial equation with those symmetries will also have ‘ob¬ 
vious’ roots. 

With a little extra effort, we then subvert the entire reason for the existence of this 
book, by proving that the ‘general’ polynomial equation of the nth degree cannot be 
solved by radicals—of a particular, special kind—when n > 5. This is a spectacular 
application of the Galois group, but in a very limited context: it corresponds roughly 
to what Ruffini proved for came close to proving) in 1813. By stealing one further 
idea from Abel, we can even remove Ruffini’s assumption, and prove that there is 
no general radical expression in the coefficients of a quintic, or any polynomial of 
degree > 5, that determines a zero. 

We could stop there. But Galois went much further: his methods are not only 
more elegant, they give much stronger results. The material in this chapter provides 
a sprinbgboard, from which we can launch into the full beauty of the theory. 


107 



108 


The Idea Behind Galois Theory 


8.1 A First Look at Galois Theory 

Galois theory is a fascinating mixture of classical and modern mathematics, and 
it takes a certain amount of effort to get used to its thought patterns. This section is 
intended to give a quick survey of the basic principles of the subject, and explain how 
the abstract treatment has developed from Galois’s original ideas. 

The aim of Galois theory is to study the solutions of polynomial equations 

f (f) = t n + a n —\t n 1 4-bao = 0 

and, in particular, to distinguish those that can be solved by a ‘formula’ from those 
that cannot. By a formula we mean a radical expression: anything that can be built 
up from the coefficients aj by the operations of addition, subtraction, multiplication, 
and division, and also—the essential ingredient—by nth roots, n = 2,3,4,_ 

In Chapter 1 we saw that polynomial equations over C of degree 1, 2, ,3 or 4 can 
be solved by radicals. The central objective of this book is a proof that the quintic 
equation is different. It cannot, in general, be solved by radicals. Along the way we 
come to appreciate the deep, general reason why quadratics, cubics, and quartics can 
be solved using radicals. 

In modern terms, Galois’s main idea is to look at the symmetries of the polyno¬ 
mial /(f). These form a group, its Galois group, and the solution of the polynomial 
equation is reflected in various properties of the Galois group. 


8.2 Galois Groups According to Galois 

Galois had to invent the concept of a group, quite aside from sorting out how 
it relates to the solution of equations. Not surprisingly, his approach was relatively 
concrete by today’s standards, but by those of his time it was highly abstract. Indeed 
Galois is one of the founders of modern abstract algebra. So to understand the modern 
approach, it helps to take a look at something rather closer to what Galois had in 
mind. 

As an example, consider the polynomial equation 
/( f ) = f 4 -4f 2 -5 =0 


which we encountered in Chapter 4. As we saw, this factorises as 

(f 2 + l)(f 2 — 5) = 0 


so there are four roots f = i,—i,s/5,—y/5. These form two natural pairs: i and — i 
go together, and so do y/5 and — \/5. Indeed, itis impossible to distinguish i from 
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— i, or \/5 from - y/5, by algebraic means, in the following sense. Write down any 
polynomial equation, with rational coefficients, that is satisfied by some selection 
from the four roots. If we let 


a = i P = -i y=V 5 8 =- Vs 

then such equations include 

cr + l=0 a + p=0 5 2 - 5 = 0 7+5 = 0 ay-p8 = 0 

and so on. There are infinitely many valid equations of this kind. On the other hand, 
infinitely many other algebraic equations, such as a + y = 0 , are manifestly false. 

Experiment suggests that if we take any valid equation connecting a, j3, 7 , and 
5, and interchange a and /3, we again get a valid equation. The same is true if we 
interchange 7 and 8. For example, the above equations lead by this process to 

j3 2 + l=0 p + a = 0 72-5 = 0 5 + 7 = 0 

py-a8 = 0 a8~py=0 p8-ay= 0 


and all of these are valid. In contrast, if we interchange a and 7 , we obtain equations 
such as 

72 + 1=0 7+/3=0 a + 5 = 0 

which are false. Exercise 8.1 outlines a simple proof that these operations preserve 
all valid equations connecting a, j3, 7 , and 5. 

The operations that we are using here are permutations of the zeros a, /3, 7 , 5. In 
fact, in the usual permutation notation, the interchange of a and ft is 


and that of 7 and 5 is 


R = 


S = 


a 

P 

7 

8 

P 

a 

7 

8 

a 

P 

7 

8 

a 

P 

8 

7 


(8.1) 


( 8 . 2 ) 


These are elements of the symmetric group S4 on four symbols, which includes all 
24 possible permutations of a, /), 7 , 5. 

If these two permutations turn valid equations into valid equations, then so must 
the permutation obtained by performing them both in turn, which is 


f a p y 8\ 

\P a 8 7 ) 


Are there any other permutations that preserve all the valid equations? Yes, of course, 
the identity 

,_( a p 7 5\ 

U P r 8 J 

It can be checked that only these four permutations preserve valid equations: the 
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other 20 all turn some valid equation into a false one. For example, if a, 8 are fixed 
and p, y are swapped, the value equation a + p 0 becomes the invalid equation 
a+ 7=0. 

It is a general fact, and an easy one to prove, that the invertible transformations 
of a mathematical object that preserve some feature of its structure always form a 
group. We call this the symmetry group of the object. This terminology is especially 
common when the object is a geometrical figure and the transformations are rigid 
motions, but the same idea applies more widely. And indeed these four permutations 
do form a group, which we denote by G. 

What Galois realised is that the structure of this group to some extent controls 
how we should set about solving the equation. 

He did not use today’s notation for permutations, and this led to potential con¬ 
fusion. To him, a permutation of, say, {1,2,3,4}, was an ordered list, such as 2413. 
Given a second list, say 3214, he then considered the substitution that changes 2413 
to 3214; that is, the map 2 i—s- 3,4 i—>- 2,1 i—>• 1,3 H >4. Nowadays we would write this 
as 

(2413 
\ 3 2 1 4 

or, reordering the top row, 

1 2 3 4\ 

1342 ; 

but Galois did not even have the i—» notation or associated concepts, so he had to 
write the substitution as 1342. His use of similar notation for both permutations and 
substitutions takes some getting used to, and probably did not make life easier for 
the people asked to referee his papers. Today’s definition of ‘function’ or ‘map’ dates 
from about 1950; it certainly helps to clarify the ideas. 

To see why permutations/substitutions of the roots matter, consider the subgroup 
H = {I,R} of G. Certain expressions in a, j3, 7 , 8 are fixed by the permutations in this 
group. For example, if we apply R to a 2 + p 2 — 5yS 2 , then we obtain p 2 + a 2 - 5yd 2 , 
which is clearly the same. In fact an expression is fixed by R if and only if it is 
symmetric in a and P . 

It is not hard to show that any polynomial in a, p, 7 , 8 that is symmetric in a and 
P can be rewritten as a polynomial in a + p, a /3, 7 , and 8. For example, the above 
expression can be written as (a + P) 2 — 2aP —5 y8 2 . But we know that <X = i,P = i, 
so that a + p = 0 and ap = 1. Hence the expression reduces to —2 — 5 7<5 2 . Now a 
and p have been eliminated altogether. 


8.3 How to Use the Galois Group 

Pretend for a moment that we don’t know the explicit zeros i, —i, \/5, — \/5, but 
that we do know the Galois group G. In fact, consider any quartic polynomial g(t) 
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with the same Galois group as our example f(t) above; that way we cannot possibly 
know the zeros explicitly. Let them be a, P ,y,5. Consider three subfields of C related 
to a,p,y,8. namely 

QCQ(y,<5) CQ(a,p,y,S) 

Let H = {I,R} C G. Assume that we also know the following two facts: 

(1) The numbers fixed by H are precisely those in Q(y, 8). 

(2) The numbers fixed by G are precisely those in Q. 

Then we can work out how to solve the quartic equation g(t) = 0, as follows. 

The numbers a + j3 and ap are obviously both fixed by H. By fact (1) they lie 
in Q(y, 8). But since 

(f - a)(t - p) = t 2 - (a + p)t + a[5 

this means that a and p satisfy a quadratic equation whose coefficients are in 
Q(y, 5). That is, we can use the formula for solving a quadratic to express a. p 
in terms of rational functions of y and 8. together with nothing worse than square 
roots. Thus we obtain a and p as radical expressions in y and 8. 

But we can repeat the trick to find y and 8. The numbers y+ 8 and y8 are fixed 
by the whole of G: they are clearly fixed by R, and also by S , and these generate 
G. Therefore y 4 8 and yd belong to Q by fact (2) above. Therefore y and 8 satisfy 
a quadratic equation over Q, so they are given by radical expressions in rational 
numbers. Plugging these into the formulas for a and y we find that all four zeros are 
radical expressions in rational numbers. 

We have not found the formulas explicitly. But we have shown that certain in¬ 
formation about the Galois group necessarily implies that they exist. Given more 
information, we can finish the job completely. 

This example illustrates that the subgroup structure of the Galois group G is 
closely related to the possibility of solving the equation g(t) = 0. Galois discovered 
that this relationship is very deep and detailed. For example, the proof that an equa¬ 
tion of the fifth degree cannot be solved by a formula boils down to this: the quintic 
has the wrong sort of Galois group. Galois’s surviving papers do not make this proof 
explicit, probably because he considered the insolubility of the quintic to be a known 
theorem, but it is an easy deduction from results that he does state: see Chapter 25. 

We present a simplified version of this argument, in a restricted setting, in Sec¬ 
tion 8.7. In Section 8.8 we remove this technical restriction using Abel’s classical 
methods. 


8.4 The Abstract Setting 

The modern approach follows Galois closely in principle, but differs in several 
respects in practice. The permutations of a,P,y,8 that preserve all algebraic rela- 
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tions between them turns out to be the symmetry group of the subfield Q(a,J3 , y, 5) 
of C generated by the zeros of g, or more precisely its automorphism group, which 
is a fancy name for the same thing. 

Moreover, we wish to consider polynomials not just with integer or rational co¬ 
efficients, but coefficients that lie in a subfield K of C (or, later, any field). The zeros 
of a polynomial f(t) with coefficients in K determine another field L which contains 
K, but may well be larger. Thus the primary object of consideration is a pair of fields 
K C L, or in a slight generalisation, a field extension L : K. Thus when Galois talks 
of polynomials, the modern approach talks of field extensions. And the Galois group 
of the polynomial becomes the group of /('-automorphisms of L, that is, of bijections 
9 : L — > L such that for all x,y £ L and k £ K 

9(x + y) = 6 (x) 4- 9 (y) 

9(xy) = 9(x)9(y) 

9(k) = k 

Thus the bulk of the theory is described in terms of field extensions and their groups 
of /^-automorphisms. This point of view was introduced in 1894 by Dedekind, who 
also gave axiomatic definitions of subrings and subfields of C. 

The method used above to solve g(t) = 0 relies crucially on knowing the con¬ 
ditions (1) and (2) at the start of Section 8.3. But can we lay hands on that kind of 
information if we do not already know the zeros of gl The answer is that we can— 
though not easily—provided we make a general study of the automorphism groups 
of field extensions, their subgroups, and the subfields fixed by those subgroups. This 
study leads to the Galois correspondence between subgroups of the Galois group and 
sub fields M of L that contain K. Chapters 9-11 set up the Galois correspondence and 
prove its key properties, and the main theorem is stated and proved in Chapter 12. 
Chapter 13 studies one example in detail to drive the ideas home. Chapters 15 and 
18 derive the spectacular consequences for the quintic. Then, starting in Chapter 16, 
we generalise the Galois correspondence to arbitrary fields, and develop the resulting 
theory in several directions. 


8.5 Polynomials and Extensions 

In this section we define the Galois group of a field extension L : K. We begin by 
defining a special kind of automorphism. 

Definition 8.1. Let L.K be a field extension, so that K is a subfield of the subfield 
L of C. A K-automorphism of L is an automorphism a of L such that 

a(k) = k for all k £ K (8.3) 


We say that a fixes k £ K if (8.3) holds. 
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Effectively condition (8.3) makes a an automorphism of the extension L : K, 
rather than an automorphism of the large field L alone. The idea of considering au¬ 
tomorphisms of a mathematical object relative to a sub-object is a useful general 
method; it falls within the scope of the famous 1872 ‘Erlangen Programme’ of Felix 
Klein. Klein’s idea was to consider every ‘geometry’ as the theory of invariants of an 
associated transformation group. Thus Euclidean geometry is the study of invariants 
of the group of distance-preserving transformations of the plane; projective geom¬ 
etry arises if we allow projective transformations; topology comes from the group 
of all continuous maps possessing continuous inverses (called ‘homeomorphisms’ or 
‘topological transformations’). According to this interpretation any field extension is 
a geometry, and we are simply studying the geometrical figures. 

The pivot upon which the whole theory turns is a result which is not in itself 
hard to prove. As Lewis Carroll said in The Hunting of the Snark, it is a ‘maxim 
tremendous but trite’. 

Theorem 8.2. If L : K is afield extension, then the set of all K-automorphisms of L 
forms a group under composition of maps. 

Proof Suppose that a and p are K-automorphisms of L. Then ap is clearly an auto¬ 
morphism; further if k £ K then ap ( k ) = a(k) = k, so that aP is a K-automorphism. 
The identity map on L is obviously a K-automorphism. Finally, OK 1 is an automor¬ 
phism of L, and for any k £ K we have 

k = a~ l a(k) = a^ 1 (k) 

so that a~ l is a K-automorphism. Composition of maps is associative, so the set of 
all K-automorphisms of L is a group. □ 

Definition 8.3. The Galois group T(L : K) of a field extension L.K is the group of 
all K-automorphisms of L under the operation of composition of maps. 

Examples 8.4. (1) The extension C : K. Suppose that a is an K-automorphism of C. 
Let j = a(i) where i = \/—I. Then 

r = (a(0) 2 = «0' 2 ) = a(-i) = -1 

since a(r) = r for all c£K. Hence either j = i or j = —i. Now for any x, y £ K 

a(x + iy) = a(x) + a(i)a(y) =x + jy 

Thus we have two candidates for K-automorphisms: 

cc\ : x + iy x + iy 
a, 2 : x + iy <—> x — iy 

Obviously a\ is the identity, and thus is an K-automorphism of C. The map a .2 is 
complex conjugation, and is an automorphism by Example 1.7(1). Moreover, 


0,2 [x + Oi) = x — 0/ = x 
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so a 2 is an R-automorphism. Obviously ay = a\, so the Galois group r(C:R) is a 
cyclic group of order 2. 

(2) Let c be the real cube root of 2, and consider Q(c) : Q. If a is a Q-automorphism 
of Q(c), then 

(a(c)) 3 = a(c 3 ) = a(2) = 2 

Since Q(c)CK we must have a(c) = c. Hence a is the identity map, and T(Q(c): Q) 
has order 1. 

(3) Let the field extension be Q(v / 2, \/3, s/5) : Q, as in Example 6.8. The analysis 
presented in that example shows that r 2 — 5 is irreducible over Q(y/2, V^). Similarly, 
r 2 — 2 is irreducible over Q(v / 3, x/5) and r 2 — 3 is irreducible over Q(-\/2, s/5). Thus 
there are three Q-automorphisms of Q(-\/2, s/3, \/5), defined by 

p 2 :s/2^,-s/2 s/3 ^ s/3 V5^V5 

py.s/2^s/2 V3h>-\/3 ^ 

p 5 : V2^V2 V3^V3 \/5h> -s/5 

It is easy to see that these maps commute, and hence generate the group Z 2 xZ 2 x Z 2 
Moreover, any Q-automorphism of Q(v / 2, s/3, s/5) must map s/2 K > ±y/2, \/3 i—>• 
±\/3, and \/5 i—> ±\/5 by considering minimal polynomials. All combinations of 
signs occur in the group Z 2 xZ 2 x Z 2 , so this must be the Galois group. 


8.6 The Galois Correspondence 

Although it is easy to prove that the set of all /('-automorphisms of a field exten¬ 
sion L: K forms a group, that fact alone does not significantly advance the subject. To 
be of any use, the Galois group must reflect aspects of the structure of L .K. Galois 
made the discovery (which he expressed in terms of polynomials) that, under certain 
extra hypotheses, there is a one-to-one correspondence between: 

(1) Subgroups of the Galois group of L.K. 

(2) Subfields M of L such that K CM. 

As it happens, this correspondence reverses inclusion relations: larger subfields cor¬ 
respond to smaller groups. First, we explain how the correspondence is set up. 

If L : K is a field extension, we call any field M such that K C M C L ail interme¬ 
diate field. To each intermediate field M we associate the group M* = T( L : M) of all 
M-automorphisms of L. Thus K* is the whole Galois group, and L* = 1 (the group 
consisting of just the identity map on L). Clearly if M C N then M* D N *, because 
any automorphism of L that fixes the elements of N certainly fixes the elements of 
M. This is what we mean by ‘reverses inclusions’. 

Conversely, to each subgroup H of T(L : K ) we associate the set H' of all el¬ 
ements x £ L such that O.(x') = x for all a £ H. In fact, this set is an intermediate 
field: 
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Lemma 8.5. IfH is a subgroup of V(L : K), then H' is a subfield ofL containing K. 


Proof. Let x,y £ H 4 , and a £ H. Then 


a(x+y) = a(x) + a(y) = x+y 


so x+y £ H'. Similarly II is closed under subtraction, multiplication, and division 
(by nonzero elements), so H T is a subfield of L. Since a £ F(L : K) we have a(k) =k 
for all k £ K. so K C H 1 '. □ 


Definition 8.6. With the above notation, II is the fixed field of H. 


It is easy to see that like *, the map f reverses inclusions: if H C G then // ' I) G‘ . 
It is also easy to verify that if M is an intermediate field and H is a subgroup of the 
Galois group, then 


M C M* t 
HCH t* 


(8.4) 


Indeed, every element of M is fixed by every automorphism that fixes all of M , and 
every element of H fixes those elements that are fixed by all of H. Example 8.4(2) 
shows that these inclusions are not always equalities, for there 


Q* t = Q(c) 7^ Q 


If we let & denote the set of intermediate fields, and ( -f the set of subgroups of the 
Galois group, then we have defined two maps 


* 

t 



which reverse inclusions and satisfy equation (8.4). These two maps constitute the 
Galois correspondence between & and Sf. Galois’s results can be interpreted as giv¬ 
ing conditions under which * and ' are mutual inverses, setting up a bijection between 
■f' and Sf. The extra conditions needed are called separability (which is automatic 
over C) and normality. We discuss them in Chapter 9. 

Example 8.7. The polynomial equation 

f(t)=t 4 -4t 2 -5=0 

was discussed in Section 8.2. Its roots are a = i, j3 = —i, y = \/5, 8 = — \/5. The 
associated field extension is L : Q where L = Q (i, \/5). which we discussed in Ex¬ 
ample 4.8. There are four Q-automorphisms of L, namely I. R.S. 7’ where I is the 
identity, and in cycle notation R = (cy.fj ).S = (y5), and T = (cy.fj )(y8 ). Recall that a 
cycle (ai... af) £ S„ is the permutation a such that cr(a ; ) = ay+i when 
<y(ak) = a\, and a(a ) = a when a ^ {ai ,.. .,ak}. Every element of S„ is a product 
of disjoint cycles, which commute, and this expression is unique except for the order 
in which the cycles are composed. 
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In fact I,R,S,T are all possible Q-automorphisms of L, because any Q- 
automorphism must send i to ±i and \/5 to ±\/5- Therefore the Galois group is 

G = {I,R,S,T} 

The proper subgroups of G are 

1 {7,/e} {I,S} {I,T} 

where 1 = {/}. It is easy to check that the corresponding fixed fields are respectively 

L Q(V5) Q(i) ®{iV5) 

Extensive but routine calculations (Exercise 8.2) show that these, together with K , 
are the only subfields of L. So in this case the Galois correspondence is bijective. 


8.7 Diet Galois 

To provide further motivation, we now pursue a modernised version of La¬ 
grange’s train of thought in his memoir of 1770-1771, which paved the way for 
Galois. Indeed we will follow a line of argument that is very close to the work of 
Ruffini and Abel, and prove that the general quintic is not soluble by radicals. Why, 
then, does the rest of this book exist? Because ‘general’ has a paradoxically special 
meaning in this context, and we have to place a very strong restriction on the kind 
of radical that is permitted. A major feature of Galois theory is that it does not as¬ 
sume this restriction. However, quadratics, cubics, and quartics are soluble by these 
restricted types of radical, so the discission here does have some intrinsic merit. It 
could profitably be included as an application in a first course of group theory, or a 
digression in a course on rings and fields. 

We have already encountered the symmetric group S„, which comprises all per¬ 
mutations of the set {1,2,...Its order is n\. When n > 2,§„ has a subgroup of 
index 2 (that is, of order n!/2); namely, the alternating group A„,F which consists 
of all products of an even number of transpositions lab). The elements of A„ are the 
even permutations. The group A„ is a normal subgroup of §„. It is well known that 
A„ is generated by all 3-cycles ( abc): see Exercise 8.7. The group A 5 holds the secret 
of the quintic, as we now explain. 

Introduce the polynomial ring C[fi,... ,f„] in n indeterminates. Let its field of 
fractions be C(ti,...,t„), consisting of rational expressions in the tj. Consider the 
polynomial 

F(t) = (t-t\)...{t-t n ) 

over C(f|,... ,f„), whose zeros are t\ ,... ,f„. Expanding and using induction, we see 
that 


F{t) = t n ~s x t n - 1 +s 2 t"- 2 + ■ ■ ■ + (- 1 )\ 


( 8 . 5 ) 
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where the sj are the elementary symmetric polynomials 

s i = 1 1 + • • • + t n 
S2 = tp2 +fl^3 + ' ' ' + fn-Rn 

S/l = 11 • ■ ■ t n 

Here s r is the sum of all products of r distinct tj. 

The symmetric group S n acts as symmetries of C(fi,... ,t n ): 

&f(t li ■ ■ • Tn) /Ocx(l), • ■ • Tct(m)) 

for / G C(ti,... ,t„). The fixed field /if of §„ consists, by definition, of all symmetric 
rational functions in the tj, which is known to be generated over C by the n elemen¬ 
tary symmetric polynomials in the tj. That is, K = C(si,... ,s n ). Moreover, the Sj 
satisfy no nontrivial polynomial relation: they are independent. There is a classical 
proof of these facts based on induction, using ‘symmetrised monomials’ 

t“'tf 2 ■ ■ ■ tf‘ + all permutations thereof 

and the so-called ‘lexicographic ordering’ of the list of exponents a\,...,a„. See 
Exercise 8.5. A more modern but less constructive proof is given in Chapter 18. 
Assuming that the Sj generate the fixed field, we consider the extension 

C(ti,...,t n ): C(si,...,s„) 

We know that in C(fi,... ,t n ) the polynomial F(t) in (8.5) factorises completely as 

F(t) = (t-t\)...{t-t n ) 

Since the sj are independent indeterminates, F(t) is traditionally called the general 
polynomial of degree n. The reason for this name is that this polynomial has a uni¬ 
versal property. If we can solve F(t) =0 by radicals, then we can solve any specific 
complex polynomial equation of degree n by radicals. Just substitute specific num¬ 
bers for the coefficients Sj. The converse, however, is not obvious. We might be able 
to solve every specific complex polynomial equation of degree n by radicals, but 
using a different formula each time. Then we would not be able to deduce a radi¬ 
cal expression to solve F(t) = 0. So the adjective ‘general’ is somewhat misleading; 
‘generic’ would be better, and is sometimes used. 

The next definition is not standard, but its name is justified because it reflects the 
assumptions made by Ruffini in his attempted proof that the quintic is insoluble. 

Definition 8.8. The general polynomial equation F(t) = 0 is soluble by Ruffini rad¬ 
icals if there exists a finite tower of subfields 

C(si,... ,s n ) = Kq C K\ C • • • C K r = C(fi,... ,t„) 

such that for j = 1,..., r, 

Kj = Kj-i((Xj) and € Kj for nj > 2, nj G N 


( 8 . 6 ) 
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The aim of this definition is to exclude possibilities like the \/—121 in Cardano’s 
solution (1.10) of the quartic equation r 4 — 15r — 4 = 0, which does not lie in the field 
generated by the roots, but is used to express them by radicals. 

Ruffini tacitly assumed that if F(t) = 0 is soluble by radicals, then those radicals 
are all expressible as rational functions of the roots t\.... ,t„. Indeed, this was the sit¬ 
uation studied by his predecessor Lagrange in his deep but inconclusive researches 
on the quintic. So Lagrange and Ruffini considered only solubility by Ruffini radi¬ 
cals. However, this is a strong assumption. It is conceivable that a solution by radi¬ 
cals might exist, for which some of the 0Cj constructed along the way do not lie in 
C(t \.... ,t n ), but in some extension of C(fi,... ,t„). For example, might be use¬ 
ful. (It is useful to solve t 5 — s\ = 0, for instance, but the solutions of this equation 
do not belong to C(fi,However, the more we think about this possibility, the 
less likely it seems. Abel thought about it very hard, and proved that if F(t) =0 is 
soluble by radicals, then those radicals are all expressible in terms of rational func¬ 
tions of the roots—they are Ruffini radicals after all. This step, historically called 
‘Abel’s Theorem’, is more commonly referred to as the ‘Theorem on Natural Irra¬ 
tionalities’. From today’s perspective, it is the main difficulty in the impossibility 
proof. So, following Lagrange and Ruffini, we start by defining the main difficulty 
away. In compensation, we gain excellent motivation for the remainder of this book. 

For completeness, we prove the Theorem on Natural Irrationalities in Section 8.8, 
using classical (pre-Galois) methods. As preparation for all of the above, we need: 

Proposition 8.9. If there is a finite tower of subfields (8.6), then it can be refined (if 
necessary increasing its length) to make all ti j prime. 

Proof. For fixed j write nj = pi...pk where the pj are prime. Let /3/ = 0 Cj l+1 ' Pk , for 
0 < l < k. Then fio £ Kj and f5j > ' £ Kj(f3/-i), and the rest is easy. □ 

For the remainder of this chapter we assume that this refinement has been per¬ 
formed, and write pj for nj as a reminder. With this preliminary step completed, we 
will prove: 

Theorem 8.10. The general polynomial equation F(t ) = 0 is insoluble by Ruffini 
radicals ifn > 5. 

All we need is a simple group-theoretic lemma. 

Lemma 8.11. (1) The symmetric group §„ has a cyclic quotient group of prime order 
p if and only if p = 2 and n > 2, in which case the kernel is the alternating group A„. 
(2) The alternating group A„ has a cyclic quotient group of prime order p if and only 
if p = 3 and n = 3,4. 

Proof. (1) We may assume n> 3 since there is nothing to prove when n = 1,2. 
Suppose that A is a normal subgroup of §„ and § „/N = Z p . Then S n /N is abelian, 
so N contains every commutator ghg~ 1 h~ 1 forg,/z £ S„. To see why, let g denote the 
image of g £ S„ in the quotient group §„/A. Since E>„/N is abelian, ghg~ l h~ l = 1 in 
S n/N\ that is, ghg~ l h~ l £ N. 
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Let gfi be 2-cycles of the form g = (ab)fi = ( ac ) where a,b,c are distinct. Then 

ghg~ l h~ l = ( bca ) 

is a 3-cycle, and all possible 3-cycles can be obtained in this way. Therefore N con¬ 
tains all 3-cycles. But the 3-cycles generate A„, so A' I) A„. Therefore p - 2 since 

|S„/A„|=2. 

(2) Suppose that IV is a normal subgroup of A„ and A n /N = Z p . Again, N contains 
every commutator. If n = 2 then A„ is trivial. When n = 3 we know that A„ = Z 3 . 

Suppose first that n = 4. Consider the commutator glig 1 h 1 where g = 
( abc)fi = (aM) for a,b,c,d distinct. Computation shows that 

ghg~ l h~ l = ( ab)(cd) 

so A must contain (12)(34), (13)(24), and (14)(23). It also contains the identity. But 
these four elements form a group V. Thus VCjV. Since V is a normal subgroup of 
A 4 and A 4 /V = Z 3 , we are done. 

The symbol V comes from Klein’s term Vierergruppe, or ‘fours-group’. Nowa¬ 
days it is usually called the Klein four-group. 

Finally, assume that n > 5. The same argument shows that N contains all per¬ 
mutations of the form ( ab)(cd). If a,b,c,d,e are all distinct (which is why the case 
n = 4 is special) then 

( 1 ab)(cd ) • ( ab)(ce ) = ( ced ) 

so N contains all 3-cycles. But the 3-cycles generate A„, so this case cannot occur. 

□ 


As our final preparatory step, we recall the expression (1.13) 

S = f[( t j- t k) 

j<k 

It is not a symmetric polynomial in the tj, but its square A = r) 2 is, because 

A=( —!)»(«- D / 2 pi {tj - tk) 

The expression A, mentioned in passing in Section 1.4, is called the discriminant of 
Ff). If a £ then the action of a sends 8 to ±5. The even permutations (those 
in A„) fix 8 , and the odd ones map 8 to 8. Indeed, this is a standard way to define 
odd and even permutations. 

We are now ready for the: 

Proof of Theorem 8.10 

Assume that F(t) = 0 is soluble by Ruffini radicals, with a tower (8.6) of sub¬ 
fields Kj in which all nj = p/ are prime. Let K = C(si, ... ,s„) and L = C(fi, ... ,t n ). 
Consider the first step in the tower. 


KCKi<ZL 
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where K\ = K(a.\). a[' £ K. ci\ K, and p = p\ is prime. 

Since (X\ £ L we can act on it by §„, and since every <7 £ S„ fixes K we have 

( <7 ( 0£ i)) P = a i 

Therefore <j(ai) = (X \, for C, a primitive pth root of unity and j(a) an integer 

between 0 and p — 1. The set of all pth roots of unity in C is a group under multipli¬ 
cation, and this group is cyclic, isomorphic to Z p . Indeed = £ a+h where a + b 
is taken modulo p. 

Clearly the map 


j ■ H► Z p 

a ^ j(a) 

is a group homomorphism. Since a\ K, some cr(ai) / «|, so j is nontrivial. Since 
Z p has prime order, hence no nontrivial proper subgroups, j must be onto. Therefore 
§„ has a homomorphic image that is cyclic of order p. By Lemma 8.11, p = 2 and 
the kernel is A„. Therefore c/.\ is fixed by A„. 

We claim that this implies that ai £ K ( 8). Since p = 2, the relation a[’ £ K 
becomes a^ £ K, so ai is a zero of t 2 — aj £ K[t}. The images of a.\ under S n must 
all be zeros of this, namely ±(t \. Now a\ is fixed by A„ but not by S„, so some 
permutation a £ S„ \A„ satisifes cr(ai) = —a\. Then 8a\ is fixed by both A„ and 
cr, hence by §„. So (5ai S K and a\ £ K(8). 

If n = 2 we are finished. Otherwise consider the second step in the tower 

K(8)CK 2 =K(8){a 2 ) 

By a similar argument, a 2 defines a group homomorphism j : A„ —> Z p , which again 
must be onto. By Lemma 8.11, p = 3 and n = 3,4. In particular, no tower of Ruffini 
radicals exists when n> 5. □ 

It is plausible that any tower of radicals that leads from C(si,..., ,v„) to a subfield 
containing C(fi,... ,t„) must give rise to a tower of Ruffini radicals. However, it is 
not at all clear how to prove this, and in fact, this is where the main difficulty of the 
problem really lies, once the role of permutations is understood. Ruffini appeared not 
to notice that this needed proof. Abel tackled the obstacle head on. 

Galois worked his way round it, by way of the Galois group—an extremely el¬ 
egant solution. The actual details of his work differ considerably from the modern 
presentation, see Neumann (2011), both notationally and strategically. However, the 
underlying idea of studying what we now interpret as the symmetry group of the 
polynomial, and deriving properties related to solubility by radicals, is central to Ga¬ 
lois’s approach. His method also went much further: it applies not just to the general 
polynomial F(t), but to any polynomial whatsoever. And it provides necessary and 
sufficient conditions for solutions by radicals to exist. 

Exercises 8.9-8.11 provide enough hints for you to show that when n = 2,3,4 
the equation F(t) = 0, where F is defined by (8.5), can be solved by Ruffini radi¬ 
cals. Therefore, despite the special nature of Ruffini radicals, we see that the quintic 
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equation differs (radically) from the quadratic, cubic, and quartic equations. We also 
appreciate the significant role of group theory and symmetries of the roots of a poly¬ 
nomial for the existence—or not—of a solution by radicals. This will serve us in 
good stead when the going gets tougher. 


8.8 Natural Irrationalities 

With a little more effort we can go the whole hog. Abel’s proof contains one fur¬ 
ther idea, which lets us delete the word ‘Ruffini’ from Theorem 8.10. This section is 
an optional extra, and nothing later depends on it. We continue to work with the gen¬ 
eral polynomial, so throughout this section L = C(/i,... ,t n ) and K = C(si,... ,s„), 
where the sj are the elementary symmetric polynomials in the tj. 

To delete ‘Ruffini’ we need: 

Definition 8.12. An extension L : K in C is radical if L = K(tt\ ..... a m ) where for 
each j — 1 ,... ,m there exists an integer n ,■ such that 

a" J {j> 2) 

The elements ctj form a radical sequence for L : K. The radical degree of the radical 
ccj is tij. 

The essential point is: 

Theorem 8.13. If the general polynomial equation F(t) = 0 can be solved by radi¬ 
cals, then it can be solved by Ruffini radicals. 

Corollary 8.14. The general polynomial equation F(t ) = 0 is insoluble by radicals 
if n > 5. 

To prove the above, all we need is the so-called ‘Theorem on Natural Irrational¬ 
ities’, which states that extraneous radicals like Jfs\ cannot help in the solution of 
F(t) = 0. More precisely: 

Theorem 8.15 (Natural Irrationalities). IfL contains an element x that lies in some 
radical extension R ofK, then there exists a radical extension R' ofK with x £ R' and 
R’ CL. 

Once we have proved Theorem 8.15, any solution of F(t ) = 0 by radicals can 
be converted into one by Ruffini radicals. Theorem 8.13 and Corollary 8.14 are then 
immediate. 

It remains to prove Theorem 8.15. A proof using Galois theory is straightforward, 
see Exercise 15.11. With what we know at the moment, we have to work a little 
harder—but, following Abel’s strategic insights, not much harder. We need several 
lemmas, and a technical definition. 
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Definition 8.16. Let R : K be a radical extension. The height of R : K is the smallest 
integer h such that there exist elements 0£i,...,0£/, € R and primes pi,...,p/, such 
that R = K{a \,..., a/,) and 

a p d e K(ai,...,aj-i) 1 <j<h 

where when j = 1 we interpret K(dCi,... ,CCj-i) as K. 

Proposition 8.9 shows that the height of every radical extension is defined. 

We prove Theorem 8.15 by induction on the height of a radical extension R that 
contains x. The key step is extensions of height 1, and this is where all the work is 
put in. 

Lemma 8.17. Let M be a subfield of L such that K C M, and let a € M, where a is 
not a pth power in M. Then 

(1) a k is not a pth power in M for k = 1,2,...,/? — 1. 

(2) The polynomial m(t ) = t p — a is irreducible over M. 

Proof. (1) Since k is prime to p there exist integers qf such that qp + lk = 1. If 
a k = b p with b € M, then 

(. a q b l ) p = a qp b lp = a qp a kl = a 


contrary to a not being a pth power in M. 

(2) Assume for a contradiction that t p — a is reducible over M. Suppose that Pit) 
is a monic irreducible factor of in(t') =t p — a over M. For 0 < j < p — 1 let Pj(t) = 
P( Qt), where tf G C C K C M is a primitive pth root of unity. Then If = P , and P, is 
irreducible for all for if P(C k t) = g(t)h(t) then P(t) = g(^~h)h(^~H). Moreover, 
m(£H) = (£ h) p — a = t p — a = m{t), so Pj divides m for all j = 0,... ,p — 1 by 
Lemma 5.6. 

We claim that If and If are coprime whenever 0<j<k<p — 1. If not, by 
irreducibility 

Pj(t) = cP k {t) cGM 

Let 

P{t) = P0 + Plf + ' ' ' + Pr- \t f * 
where r < p. By irreducibility, po f 0. Then 

Pj(t) =P0 + PxVt + • • • + Pr-lC, i[r ~ l) t r ~ l + C ^ 

p k (t) = p 0 +p 1 ^ + ---+P,-iC Mr “ 1 V“ 1 + CV 


so c = Cf G k i r from the coefficient of t r . But then po = (J'^i'po. Since po f 0, we 
must have = 1, so r = p. But this implies that dP = dm. so m is irreducible 

over M. 

Thus we may assume that the Pj are pairwise coprime. We know that Pfm for all 
j, so 

P 0 Pi ...Pp- 11 m 
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Since dp = r , it follows that pr < p, so r = 1. Thus P is linear, so there exists b £ M 
such that ( t — b)\m(t). But this implies that b p = a, contradicting the assumption that 
a is not a pth power. Thus t p — a is irreducible. □ 

Now suppose that R is a radical extension of height 1 over M. Then R = M(a) 
where a p £ M, a £M. Therefore every x £ R\M is uniquely expressible as 

x = xo+xia+X 2 (x 2 -i — x p -\a p ~ l (8.7) 

where the xj £ M.This follows since [M(a) : M] = p by irreducibility of m. We want 
to put x into a more convenient form, and for this we need the following result: 

Lemma 8.18. Let L C M be fields, and let p be a prime such that L contains a 
primitive pth root of unity £. Suppose that a,x o,...,x„_i £ M with a^O, and L 
contains all of the elements 

X r =x 0 + (C r a)x l + (C r a) 2 x 2 ^ - 1 - (C'a) p ~ l x P -i ( 8 . 8 ) 

for 0 < r < p — 1. Then each of the elements xq, OCX i,OC 2 x 2 , ..., CC p ~ 1 x p -1 also lies in 
L. Hence, if x i = 1, then a and each Xj (0 < j < p — 1) lies in L. 

Proof For any m with 0 < m < p — 1, consider the sum 

x 0 +r m *i+r 2 ”%+• • •+ c~ {p ~ l)m x P -1 

Since 1 + £ + £ 2 +-b £ p ~ l = 0, all terms vanish except for those in which the 

power of C is zero. These terms sum to pa'"x m . Therefore pa'"x m £ L, so cc m x m £ L. 

If V| = 1 then the case m = 1 shows that a £ L, so now x m £ L for all m with 
0 < m < p — 1 . □ 


We can also prove: 

Lemma 8.19. With the above notation, for a given x £ R, there exist ji £ M(a) and 
b £ M with b = fi p , such that b is not the pth power of an element ofM, and 

x = yo + fi +y2j3 2 + ---yp-ij3 p “ 1 

where the }’; £ M. 

Proof We know that x f M , so in (8.7) some f 0 for 1 <s< p — 1. Let j3 = x s a s , 
and let b = fi 1 '. Then b = x§a sp = xfa s , and if b is a pth power of an element of M 
then a s is a pth power of an element of M, contrary to Lemma 8.17(2). Therefore b 
is not the pth power of an element of M. 

Now s is prime to p, and the additive group Z p is cyclic of prime order p, so s 
generates Z p . Therefore, up to multiplication by nonzero elements of M, the powers 
p J of p run through the powers of a precisely once as j runs from 0 to p — 1. Since 
J3° = 1 ,P 1 = we have 

x = y 0 + P +y2j3 2 H- hyp-ifi 1 ’” 1 


for suitable yj £ M , where in fact yo = jcq. 


□ 
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Lemma 8.20. Let q £ L. Then the minimal polynomial of q over K splits into linear 
factors over L. 

Proof The element q is a rational expression q{t\,...,t n ) £ C(fi,... ,t n ). The poly¬ 
nomial 

fq(t) [| 4 ■ ■ ■ Tcr(n))) 

has q as a zero. Symmetry under S n implies that f q (t) £ K\t\. The minimal polyno¬ 
mial m q of q over K divides f q , and f q is a product of linear factors; therefore m q is 
the product of some subset of those linear factors. □ 

We are now ready for the climax of Diet Galois: 

Proof of Theorem 8.15. We prove the theorem by induction on the height h of R. 

If /z = 0 then the theorem is obvious. 

Suppose that h > 1. Then R = R \ ( a j where If is a radical extension of K of 
height h— 1, and c/J' £ R\. a f R\. with p prime. Let a p = a £ R\. 

By Lemma 8.19 we may assume without loss of generality that 

x = xo + OL +X2 & 2 H-hxp-ia p ~ 1 

where the xj £R\. (Replace a by as in the lemma, and then change notation back 
to a.) The mimimum polynomial m(t) of x over K splits into linear factors in L by 
Lemma 8.20. In particular, x is a zero of m{f), while all zeros of m(t) lie in L. 

Take the equation m(a ) = 0, write x as above in terms of powers of a with 
coefficients in R\, and consider the result as an equation satisfied by a. The equation 
has the form f(a) = 0 where /(f) £ R\ [f]. Therefore /(f) is divisible by the minimal 
polynomial of a , which is t p — a. Hence all the roots of that equation, namely Cf CL 
for 0 < r < p — 1, are also roots of /(f). Therefore all the elements X r in (8.8) are 
roots of m(f), so they lie in L. Lemma 8.18 now shows that a,xo,X 2 ,.. .x p _i £ L. 

Also, a p ,x o,x' 2 ,.. .x p -1 £ R]. The height of R\ is /z — 1, so by induction, each of 
these elements lies in some radical extension of K that is contained in L. The subfield 
J generated by all of these radical extensions is clearly radical (Exercise 8.12), and 
contains a p ,xo,X 2 , ■ ■ .x„_i. Then x £ J (a) C L , and J(a) is radical. This completes 
the induction step, and with it, the proof. □ 

So much for the general quintic. We have used virtually everything that led up to 
Galois theory, but instead of thinking of a group of automorphisms of a field exten¬ 
sion, we have used a group of permutations of the roots of a polynomial. Indeed, we 
have used only the group §„, which permutes the roots tj of the general polynomial 
F(t). It would be possible to stop here, with a splendid application of group theory 
to the insolubility of the ‘general’ quintic. But for Galois, and for us, there is much 
more to do. The general quintic is not general enough, and it would be nice to find 
out why the various tricks used above actually work. At the moment, they seem to 
be fortunate accidents. In fact, they conceal an elegant theory (which, in particular, 
makes the Theorem on Natural Irrationalities entirely obvious; so much so that we 
can ignore it altogether). That theory is, of course, Galois theory. Now motivated up 
to the hilt, we can start to develop it in earnest. 
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EXERCISES 

8.1 Prove that in Section 8.2, the permutations R and S of equations (8.1, 8.2) pre¬ 
serve every valid polynomial equation over Q relating a, /j, y, and 8. (Hint: 
The permutation R has the same effect as complex conjugation. For the permu¬ 
tation S , observe that any polynomial equation in a. p. y. <5 can be expressed 
as 

PY + qS = 0 

where p,q £ Q(i). Substitute y= \/5 ,8 = —V5 to derive a condition on p and 
q. Show that this condition also implies that the equation holds if we change 
the values so that y = —y/5,8 = \/5.) 

8.2 Show that the only subfields of Q(i, \/5) are Q, Q(i), Q(\/5), Q(iV5), and 

Q(i,v 5). 

8.3 Express the following in terms of elementary symmetric polynomials of 
a, j8, y. 

(a) a 2 +fi 2 + y 2 

(b) a 3 + j3 3 + y 3 

(c) a 2 P + a 2 y+P 2 a + p 2 y+Y 2 a + Y 2 p 

(d) (a-/3) 2 -f-Q3 -y) 2 + (y-a) 2 

8.4 Prove that every symmetric polynomial p{x,y) £ Q[x,y] can be written as a 
polynomial in xy and x+y, as follows. If p contains a term ax'yP with i y j £ N 
and a £ Q, show that it must also contain the term ax J y‘. Use this to write p as 
a sum of terms of the form a(x ! yJ + x J 'y) or ax'y'. Observe that 

x'yi +x 2 y‘ = x l y l {x 2 ~ l +y 2 ~‘) if i < j 

*y = (xy)' 

(x'+y) = {x+y) (x ‘~ 1 + y i_ 1 ) — xy (x ,_2 + y '~ 2 ). 

Hence show that p is a sum of terms that are polynomials in x+y,xy. 

8.5* This exercise generalises Exercise 8.3 to n variables. Suppose that 
p(t\ ,...,t n ) £ K[t\,...,t n \ is symmetric and let the Sj be the elementary sym¬ 
metric polynomials in the tj. Define the rank of a monomial t^t^ 2 ■ ■ -t„" to be 

a i + 2ai H- na„- Define the rank of p to be the maximum of the ranks of all 

monomials that occur in p , and let its part of highest rank be the sum of the 
terms whose ranks attain this maximum value. Find a polynomial q composed 
of terms of the form ks b p s b -, 2 ... s b ", where k£K , such that the part of q of high¬ 
est rank equals that of p. Observe that p — q has smaller rank than p, and use 
induction on the rank to prove that p is a polynomial in the s,. 
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OoAa + <JiAa+ 1 H-b^n-l^A+n-l + fln^-A+n =0 (A > 1) 

Show how to use these identities inductively to obtain formulas for the Ay. 

8.7 Prove that the alternating group A„ is generated by 3-cycles. 

8.8 Prove that every element of A 5 is the product of two 5-cycles. Deduce that A 5 
is simple. 

8.9 Solve the general quadratic by Ruffini radicals. (Hint: If the roots are a 1 , 1 X 2 , 
show that cq — cq is a Ruffini radical.) 

8.10 Solve the general cubic by Ruffini radicals. (Hint: If the roots are a. \ . 0 . 2 . oq, 
show that cq + wcq + (era?, and cq + to 1 0.2 + CQCC 3 are Ruffini radicals.) 

8 .11 Suppose that I C J are subfields of C(q, ... ,tn) (that is, subsets closed under 

the operations , x, -=-), and J is generated by J\ .7, where I C Jj C J 

for each j and Jj : I is radical. By induction on r, prove that J : I is radical. 

8.12 Mark the following true or false. 

(a) The /('-automorphisms of a field extension L : K form a subfield of C. 

(b) The A-automorphisms of a field extension L : K form a group. 

(c) The fixed field of the Galois group of any finite extension L : K contains 
K. 

(d) The fixed field of the Galois group of any finite extension L: K equals K. 

(e) The alternating group A 5 has a normal subgroup H with quotient isomor¬ 
phic to Z 5 . 

(f) The alternating group A 5 has a normal subgroup H with quotient isomor¬ 
phic to Z 3 . 
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(g) The alternating group A 5 has a normal subgroup H with quotient isomor¬ 
phic to Z 2 . 

(h) The general quintic equation can be solved using radicals, but it cannot 
be solved using Ruffini radicals. 
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Chapter 9 

Normality and Separability 


In this chapter we define the important concepts of normality and separability for 
field extensions, and develop some of their key properties. 

Suppose that K is a subfield of C. Often a polynomial p(t) £ K\t\ has no zeros 
in K. But it must have zeros in C, by the Fundamental Theorem of Algebra, Theo¬ 
rem 2.4. Therefore it may have at least some zeros in a given extension field L of K. 
For example t 2 + 1 £ R[f] has no zeros in R, but it has zeros ±i £ C, in Q (i), and for 
that matter in any subfield containing Q(i). We shall study this phenomenon in detail, 
showing that every polynomial can be resolved into a product of linear factors (and 
hence has its full complement of zeros) if the ground field K is extended to a suitable 
‘splitting field’ N , which has finite degree over K. An extension N : K is normal if 
any irreducible polynomial over K with at least one zero in N splits into linear factors 
in N. We show that a finite extension is normal if and only if it is a splitting field. 

Separability is a complementary property to normality. An irreducible polyno¬ 
mial is separable if its zeros in its splitting field are simple. It turns out that over C, 
this property is automatic. We make it explicit because it is not automatic for more 
general fields, see Chapter 16. 


9.1 Splitting Fields 

The most tractable polynomials are products of linear ones, so we are led to single 
this property out: 

Definition 9.1. If K is a subfield of C and / is a nonzero polynomial over K , then / 
splits over K if it can be expressed as a product of linear factors 

f{t) = k(t-a l )...(t-a n ) 


where k, cc \,..., a„ £ K. 

If this is the case, then the zeros of / in K are precisely cq,..., a„. The Funda¬ 
mental Theorem of Algebra, Theorem 2.4, implies that / splits over K if and only if 
all of its zeros in C actually lie in K. Equivalently, K contains the subfield generated 
by all the zeros of /. 
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Examples 9.2. (1) The polynomial f(t) = f 3 1 € Q[f] splits over C, because it can 
be written as 

f(t) = (t~l)(t-co)(t-co 2 ) 

where ft) = e 2,r '/ 3 € C. Similarly, / splits over the subfield Q(i, a/ 3) since ft) € 
Q(i,V 3), and indeed / splits over Q(ft>), the smallest subfield of C with that prop¬ 
erty. 

(2) The polynomial /(f) = t 4 — 4t 2 — 5 splits over Q(i, s/5). because 
/(f) = (f-))(f + ))(f-v / 5)(f + v / 5) 

However, over Q(i) the best we can do is factorise it as 

(t — i)(t + i)(t 2 — 5) 

with an irreducible factor t 2 — 5 of degree greater than 1. (It is easy to show that 5 is 
not a square in Q(i).) 

So over Q(f), the polynomial / does not split. This shows that even if a poly¬ 
nomial /(f) has some linear factors in an extension field L, it need not split over 
L. 


If / is a polynomial over K and L is an extension field of K, then / is also a 
polynomial over L. It therefore makes sense to talk of / splitting over L, meaning 
that it is a product of linear factors with coefficients in L. We show that given K 
and / we can always construct an extension E of K such that / splits over E. It is 
convenient to require in addition that / does not split over any smaller field, so that 
E is as economical as possible. 

Definition 9.3. A subfield E of C is a splitting field for the nonzero polynomial / 
over the subfield K of C if K C E and 

(1) / splits over E. 

(2) If K C E' C E and / splits over E' then E' = E. 

The second condition is clearly equivalent to: 

(2') E = K(<J\ ( 7,1 ) where <n,..., <7„ are the zeros of / in E. 

Clearly every polynomial over a subfield K of C has a splitting field: 

Theorem 9.4. IfK is any subfield ofC and f is any nonzero polynomial over K, then 
there exists a unique splitting field T.for f over K. Moreover, [E : K] is finite. 

Proof. We can take E = K(a\ _, ( 7 „), where the < 7 , are the zeros of / in C. In 

fact, this is the only possibility, so E is unique. The degree [E : K] is finite since 
K(a u ...,O n ) is finitely generated and algebraic, so Lemma 6.11 applies. □ 

Isomorphic subfields of C have isomorphic splitting fields, in the following 
strong sense: 
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Lemma 9.5. Suppose that i : K —> K' is an isomorphism of subfields of C. Let f be 
a nonzero polynomial over K and letl.f> K be the splitting field for f. Let L be any 
extension field ofK' such that i{f ) splits over L. Then there exists a monomorphism 
j : E —> L such that j \k = l. 

Proof We have the following situation: 

K —> E 

1 4- 4 -j 

K' -s- L 

where j has yet to be found. We construct j using induction on df. As a polynomial 
over E, 

f(t) = k(t-a x )...(t-a n ) 

The minimal polynomial m of G\ over K is an irreducible factor of /. Now I (m) 
divides i (/) which splits over L, so that over L 

l(rn) = (t— a r ) 

where cq,..., a r £ L. Since I (m) is irreducible over K' it must be the minimal poly¬ 
nomial of a,i over K'. So by Theorem 5.16 there is an isomorphism 

ji ■ K(oi) ^K'(ai) 

such that ji\x = t and j\{ 0 \) = a\. Now E is a splitting field over Tf((7i) of the 
polynomial g = f/(t — o i). By induction there exists a monomorphism j : E — > L 
such that y'lx-tffj) = j\. But then j \k = i and we are finished. □ 

This enables us to prove the uniqueness theorem. 

Theorem 9.6. Let i : K —> K' be an isomorphism. Let E be the splitting field for f 
over K, and letL' be the splitting field for l(f) over K'. Then there is an isomorphism 
j : E —> TJ such that j\k = l. In other words, the extensions E : K and T! : K' are 
isomorphic. 

Proof. Consider the following diagram: 

K -y E 

1 4- 4-7 

K' -> E' 

We must find j to make the diagram commute, given the rest of the diagram. By 
Lemma 9.5 there is a monomorphism j : E —> E' such that j \k = l. But /(E) is clearly 
the splitting field for i{f) over K', and is contained in E'. Since E' is also the splitting 
field for i (/) over K', we have j( E) = E', so that j is onto. Hence j is an isomorphism, 
and the theorem follows. □ 
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Examples 9.7. (1) Let f(t) = ( t 2 — 3)(f 3 + 1) over Q. We can construct a splitting 
field for / as follows: over C the polynomial / splits into linear factors 


f(t) = (t + V3)(t-V3)(t+l) 


1 +iVl\ 
2 ) 


1 - jV3 \ 
2 ) 


so there exists a splitting field in C, namely 



l+iV3\ 
2 ) 


This is clearly the same as Q(\/3,i). 

(2) Let f{t) = ( t 2 — 2f — 2)(f 2 + 1) over Q. The zeros of / in C are 1 ± y/3, ±/, so 
a splitting field is afforded by Q(1 +y/3,i) which equals Q(\/3 ,i). This is the same 
field as in the previous example, although the two polynomials involved are different. 

(3) It is even possible to have two distinct irreducible polynomials with the same 
splitting field. For example t 2 — 3 and t 2 — 2t — 2 are both irreducible over Q, and 
both have Q(\/3) as their splitting held over Q. 


9.2 Normality 

The idea of a normal extension was explicitly recognised by Galois (but, as always, 
in terms of polynomials over C). In the modern treatment it takes the following form: 

Definition 9.8. An algebraic held extension L : K is normal if every irreducible 
polynomial / over K that has at least one zero in L splits in L. 

For example, C : K is normal since every polynomial (irreducible or not) splits 
in C. On the other hand, we can find extensions that are not normal. Let a be the real 
cube root of 2 and consider Q(a) : Q. The irreducible polynomial t '' - 2 has a zero, 
namely a, in Q(a), but it does not split in Q(oc). If it did, then there would be three 
real cube roots of 2, not all equal. This is absurd. 

Compare with the examples of Galois groups given in Chapter 8 . The normal 
extension C : R has a well-behaved Galois group, in the sense that the Galois cor¬ 
respondence is a bijection. The same goes for Q(v / 2, \/3, x/5) : Q. In contrast, the 
non-normal extension Q(a) : Q has a badly behaved Galois group. Although this is 
not the whole story, it illustrates the importance of normality. 

There is a close connection between normal extensions and splitting fields which 
provides a wide range of normal extensions: 

Theorem 9.9. Afield extension L: K is normal and finite if and only ifL is a splitting 
field for some polynomial over K. 
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Proof. Suppose L.K is normal and finite. By Lemma 6.11, L = K(a ,\,..., a s ) for 
certain a, algebraic over K. Let m ; be the minimal polynomial of over K and let 
/ = m\.. ,m s . Each ntj is irreducible over K and has a zero a 7 - £ L, so by normality 
each nij splits over L. Hence / splits over L. Since L is generated by K and the zeros 
of /, it is the splitting field for / over K. 

To prove the converse, suppose that L is the splitting field for some polynomial g 
over K. The extension L.K is then obviously finite; we must show it is normal. To 
do this we must take an irreducible polynomial / over K with a zero in L and show 
that it splits in L. Let MD1 be a splitting field for fg over K. Suppose that 6 \ and 
02 are zeros of / in M. By irreducibility, / is the minimal polynomial of 9\ and 02 
over K. 

We claim that 

[L(0j): L] = [L(0 2 ): L] 

This is proved by an interesting trick. We look at several subfields of M. namely 
K ,L,K{0\) ,L(9\) ,K(9 2 ) ,L(0 2 ). There are two towers 

KCK(Qi) CL(0i) CM 
KCK(9 2 ) CL(0 2 ) CM 

The claim will follow from a simple computation of degrees. For j = 1 or 2 

[L(0,): L][L : K\ = [L(9j ): K\ = [L{Qj ): *(0,-)][*(0,-): K\ (9.1) 

By Proposition 6.7, \K (0]) : K] = [/('('0 2 ) : K\. Clearly L(6j) is the splitting field for 
g over K(9j), and by Corollary 5.13 K{9\ ) is isomorphic to K{9 2 ). Therefore by 
Theorem 9.6 the extensions L(9j) : K(9j) are isomorphic for j = 1,2, so they have 
the same degree. Substituting in (9.1) and cancelling, 

[L(0 1 ):L] = [L(0 2 ):L] 

as claimed. From this point on, the rest is easy. If 0i € L then [L(0i) : L] = 1, so 
[L{9 2 ): L] = 1 and 9 2 £ L also. Hence L.K is normal. □ 


9.3 Separability 

Galois did not explicitly recognise the concept of separability, since he worked only 
with the complex field, where, as we shall see, separability is automatic. However, 
the concept is implicit in his work, and must be invoked when studying more general 
fields. 

Definition 9.10. An irreducible polynomial / over a subfield K of C is separable 
over K if it has simple zeros in C, or equivalently, simple zeros in its splitting field. 
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This means that over its splitting field, or over C, / takes the form 
f(t)=k(t-Oi)...(t-o„) 
where the C 7 ; are all different. 

Example 9.11. The polynomial f 4 +t 2 +t 2 +t +1 is separable over Q, since its zeros 
in C are e 2m / 5 , e 4m / 5 , e 671 '/ 5 , e 87r '/ 5 , which are all different. 

For polynomials over R. there is a standard method for detecting multiple zeros 
by differentiation. To obtain maximum generality later, we redefine the derivative in 
a purely formal manner. 

Definition 9.12. Suppose that K is a subfield of C, and let 
fit') — qq a\t a n t n £ K [t] 

Then the, formal derivative of / is the polynomial 

D f = a \ T 2ait T * * * T na n t n 4 £ K [t] 

For K = R (and indeed for K = C) this is the usual derivative. Several useful 
properties of the derivative carry over to D. In particular, simple computations (Ex¬ 
ercise 9.3) show that for all polynomials / and g over K, 

D(f + g) = Df+Dg 
D(fg ) = 0 Df)g+f(Dg) 

Also, if A £ K then D{ A) =0, so 

D(A/)=A(D/) 

These properties of D let us state a criterion for the existence of multiple zeros 
without knowing what the zeros are. 

Lemma 9.13. Let f / 0 be a polynomial over a subfield K of C, and let E be its 
splitting field. Then f has a multiple zero (in C or E) if and only if f and D f have a 
common factor of degree > 1 in K[t], 

Proof Suppose / has a repeated zero in E, so that over E 

fit) = it-a) 2 g(t) 

where a £ E. Then 

Df=(t-a)[(t-a)Dg + 2g\ 

so / and Df have a common factor (t — a) in E[f]. Hence / and Df have a common 
factor in K[t\, namely the minimal polynomial of a over K. 

Now suppose that / has no repeated zeros. Suppose that / and Df have a common 
factor, and let a be a zero of that factor. Then / = (t — a)g and Df = (t — a)h. 
Differentiate the former to get (t cx)h -- Df = g + {t — Ot.)Dg, so (t — a) divides g, 
hence ( t — a ) 2 divides /. □ 
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We now prove that separability of an irreducible polynomial is automatic over 
subfields of C. 

Proposition 9.14. If K is a subfield ofC then every irreducible polynomial over K 
is separable. 

Proof. An irreducible polynomial / over K is inseparable if and only if / and Df 
have a common factor of degree > 1. If so, then since / is irreducible the common 
factor must be /, but Df has smaller degree than /, and the only multiple of / having 
smaller degree is 0, so Df = 0. Thus if 

fit) = ao H- \-a m t"' 

then this is equivalent to na„ = 0 for all integers n > 0. For subfields of C, this is 
equivalent to a n = 0 for all n > 0. □ 


EXERCISES 

9.1 Determine splitting fields over Q for the polynomials f 3 — 1 .t 4 +5t 2 + 6 .t () — 8 , 
in the form Q(ai,..., af) for explicit a,. 

9.2 Find the degrees of these fields as extensions of Q. 

9.3 Prove that the formal derivative D has the following properties: 

(a) D(f + g)=Df+Dg 

(b) D(fg) = ( Df)g + f(Dg) 

(c) If/(f) = t”, then Df(t) = nt n ~ l 

9.4 Show that we can extend the definition of the formal derivative to K(t) by 
defining 

D{f/g) = {Df.g-f-Dg)/g 2 

when g f 0. Verify the relevant properties of D. 

9.5 Which of the following extensions are normal? 

(a) Q(t ): Q 

(b) Q(v/=5): Q 

(c) Q(a) : Q where a is the real seventh root of 5 
fd) Q(\/5, a): Q(ct), where a is as in (c) 

(e) R(>/=7) : R 

9.6 Show that every extension in C, of degree 2, is normal. Is this true if the degree 
is greater than 2 ? 
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9.7 If E is the splitting field for / over K and K C L C E, show that E is the splitting 
field for / over L. 

9.8* Let / be a polynomial of degree n over K , and let E be the splitting field for 
/ over K. Show that [E : K] divides n\ (Hint: Use induction on n. Consider 
separately the cases when / is reducible or irreducible. Note that albl divides 
(i a + b)l (why?).) 

9.9 Mark the following true or false. 

(a) Every polynomial over Q splits over some subfield of C. 

(b) Splitting fields in C are unique. 

(c) Every finite extension is normal. 

(d) Q(v / 19): Q is a normal extension. 

(e) Q(^l9): Q is a normal extension. 

(f) Q(v'T9) : Q("s/T 9) is a normal extension. 

(g) A normal extension of a normal extension is a normal extension. 



Chapter 10 
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When proving the Fundamental Theorem of Galois theory in Chapter 12, we will 
need to show that if H is a subgroup of the Galois group of a finite normal extension 
L : K, then Ii'' i = H. Here the maps * and f are as defined in Section 8.6. Our method 
will be to show that H and H * are finite groups and have the same order. Since we 
already know that H C // \ the two groups must be equal. This is an archetypal 
application of a counting principle : showing that two finite sets, one contained in the 
other, are identical, by counting how many elements they have, and showing that the 
two numbers are the same. 

It is largely for this reason that we need to restrict attention to finite extensions 
and finite groups. If an infinite set is contained in another of the same cardinality, 
they need not be equal—for example, ZCQ and both sets are countable, but Z ^ Q. 
So counting principles may fail for infinite sets. 

The object of this chapter is to perform part of the calculation of the order of H*. 
Namely, we find the degree [H' : K] in terms of the order of H. In Chapter 11 we 
find the order of H^* in terms of this degree; putting the pieces together will give the 
desired result. 


10.1 Linear Independence of Monomorphisms 

We begin with a theorem of Dedekind, who was the first to make a systematic 
study of field monomorphisms. 

To motivate the theorem and its proof, we consider a special case. Suppose that 
K and L are subfields of C, and let A and t/ be monomorphisms K —► L. We claim 
that A cannot be a constant multiple of ji unless A = ji. By ‘constant’ here we mean 
an element of L. Suppose that there exists a G L such that 

p(x) = aX(x) (10.1) 

for all x £ K. Replace x by yx, where y £ K, to get 

/1 (yx) = aX (yx) 

Since A and fi are monomorphisms, 

=aA(y)A(x) 
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Multiplying (10.1) by A(y), we also have 

A {y)p (. x ) = aX (y)A (x) 

Comparing the two, X (y) = p (y) for all y, so X = p. 

In other words, if X and p are distinct monomorphisms K -7 L, they must be 
linearly independent over L. 

Next, suppose that A|, A 2 , A 3 are three distinct monomorphisms K —y L, and as¬ 
sume that they are linearly dependent over L. That is, 

(/1A 1 — 02 X 2 + g 3 A 3 = 0 


for a ; £ L. In more detail, 

aiAi(x)+a 2 A 2 (x)+a 3 A 3 (x) = 0 ( 10 . 2 ) 

for all x £ K. If some aj = 0 then we reduce to the previous case, so we may assume 
all aj fi 0 . 

Substitute yx for x in (10.2) to get 

a\Xi(yx) + 02 X 2 (yx) + a 3 A 3 (yx) = 0 (10.3) 

That is, 

[aiAi(y)]Ai(x) + [a 2 A 2 (y)]A 2 (x) + [a 3 A 3 (y)]A 3 (x) = 0 (10.4) 

Relations (10.2) and (10.4) are independent—that is, they are not scalar multiples 
of each other—unless Ai(y) = A 2 (y) = A 3 (y), and we can choose y to prevent this, 
therefore we may eliminate one of the Xj to deduce a linear relation between at most 
two of them, contrary to the previous case. Specifically, there exists y £ K such that 
X\ (y) 7 ^ A 3 (y). Multiply ( 10 . 2 ) by A 3 (y) and subtract from (10.4) to get 

[aiAi(y) - fliA 3 (y)]Ai (x) + [ 0^2 (y) - a 2 A 3 (y)]A 2 (x) = 0 

Then the coefficient of X\ (x) is a 1 (Ai (y) — A 3 (y)) 7 ^ 0 , a contradiction. 

Dedekind realised that this approach can be used inductively to prove: 

Lemma 10.1 (Dedekind). If K and L are subfields of C, then every set of distinct 
monomorphisms K —> L is linearly independent over L. 

Proof Let X\,...,X n be distinct monomorphisms K —> L. To say these are linearly 
independent over L is to say that there do not exist elements £ L such that 

aiAi(x)-t-ba„A,,(x )=0 (10.5) 

for all x £ K , unless all the are 0 . 

Assume the contrary, so that (10.5) holds. At least one of the a,- is non-zero. 
Among all the valid equations of the form (10.5) with all a, fi 0, there must be at 
least one for which the number n of non-zero terms is least. Since all A ; are non¬ 
zero, n fi 1. We choose notation so that equation (10.5) is such as expression. Hence 
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we may assume that there does not exist an equation like (10.5) with fewer than n 
terms. From this we deduce a contradiction. 

Since Ai yb A„, there exists y £ K such that Ai ( y ) A„(y). Therefore yf=- 0. Now 

(10.5) holds with yx in place of x, so 

«tAi(yx) H- \-a„X,,(yx) =0 


for all x £ K, whence 


aiAi(y)Ai(x)d - \-a n X n (y)X n (x) = 0 (10.6) 

for all x £ K. Multiply (10.5) by X\ (y) and subtract (10.6), so that the first terms 
cancel: we obtain 

ai [X 2 (x)Ai (y) - A 2 (x) A 2 (y)] d - 1 -a n [A„ (x)Xi (y) - X n (x) X n (y)] = 0 

The coefficient of A„(x) is a n [Ai (y) — A,, (y)] yb 0, so we have an equation of the form 
(10.5) with fewer terms. Deleting any zero terms does not alter this statement. This 
contradicts the italicised assumption above. 

Consequently no equation of the form (10.5) exists, so and the monomorphisms 
are linearly independent. □ 

Example 10.2. Let K = Q(a) where a = £ R. There are three monomorphisms 

K —» C, namely 

Ai (p + qa + ra 2 ) = p + qa + ra 2 
X 2 (p + qa + ra 2 ) = p + qcoa + rara 2 
A 3 (p + qa + ra 2 ) = p + qco 2 a + rcoa 2 

where p,q,r £ Q and to is a primitive cube root of unity. We prove by ‘bare hands’ 
methods that the Xj are linearly independent. Suppose that aiAi(x) + a 2 A 2 (x) + 
u^Xt, (x) = 0 for all x £ K. Set x = 1, a, a 2 respectively to get 

01 +a 2 +a 2 = 0 
a 1 -f- COa 2 T (0~a 2 — 0 
ai + to 2 a 2 + (Oa 2 = 0 

The only solution of this system of linear equations is a\ = a 2 = a 2 = 0. 

For our next result we need two lemmas. The first is a standard theorem of linear 
algebra, which we quote without proof. 

Lemma 10.3. Ifn > m then a system ofm homogeneous linear equations 
a, 1 x 1 H-b a m x n = 0 1 < i < m 

in n unknowns X\,x n , with coefficients a,j in afield K, has a solution in which the 
X, are all in K and are not all zero. 
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This theorem is proved in most first-year undergraduate linear algebra courses, 
and can be found in any text of linear algebra, for example Anton (1987). 

The second lemma states a useful general principle. 

Lemma 10.4. If G is a group whose distinct elements are gi,... ,g n , and if g £ G, 
then as j varies from 1 to n the elements ggj run through the whole ofG, each element 
of G occurring precisely once. 

Proof. If h £ G then g 1 h = g ; for some j and h = ggj. If ggj = ggj then g-, = 
g~ l ggi = g~ l ggj = gj- Thus the map gi > ggi is a bijection G —» G. and the result 
follows. □ 

We also recall some standard notation. We denote the cardinality of a set S by 
|S|. Thus if G is a group, then |G| is the order of G. For example, |S„| = n\ and 
|A„| = n!/2. 

We now come to the main theorem of this chapter, whose proof is similar to that 
of Lemma 10.1, and which can be motivated in a similar manner. 

Theorem 10.5. Let G be a finite subgroup of the group of automorphisms of a field 
K, and let Kq be the fixed field of G. Then [K : Kf\ = |G|. 

Proof. Let n = |G|, and suppose that the elements of G are gi,... ,g n , where gi = 1. 
We prove separately that [K : Kq\ < n and [K : K ( f > n are impossible. 

(1) Suppose that [K : Ag] = m < n. Let {x\,... ,x m } be a basis for K over Kq. By 
Lemma 10.3 there exist y i,..., y„ £ K, not all zero, such that 

yigi (xj) 4 - \-y n gn{xi) = 0 (10.7) 

for i = 1,... ,m. Let x be any element of K. Then 


x = aixi-\ -h a m x m 


where a,\ ,..., a m £ Kq. Hence 


yig\{x) + ■ ■ ■+y n gn{x) = y\g\ 



H- \-yngn 



= - 'ryngn{xi)\ 

l 


= 0 


using (10.7). Hence the distinct monomorphisms gi,...,g n are linearly dependent, 
contrary to Lemma 10.1. Therefore m > n. 

(2) Next, suppose for a contradiction that [/if: Kf > n. Then there exists a set of n +1 
elements of K that are linearly independent over Kq; let such a set be {x\,... ,x n+ i}. 
By Lemma 10.3 there exist y \,... ,y„+i £ K. not all zero, such that for j = 1,... ,n 


y\gj{x\) - Vy n+ igj{x n +i) = 0 


( 10 . 8 ) 
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We subject this equation to a combinatorial attack, similar to that used in proving 
Lemma 10.1. Choose yi,... ,y n +\ so that as few as possible are non-zero, and renum¬ 
ber so that 

y\i • ■ ■ ijr 7^ 0) }V+1) • • ■ ^n+\ = o 

Equation (10.8) now becomes 

H- \-yrgj{x r ) = 0 (10.9) 

Let g £G, and operate on (10.9) with g. This gives a system of equations 

g(yi)ggji x i) 4 - \-g(yr)ggj{xr) = 0 

By Lemma 10.4, as j varies, this system of equations is equivalent to the system 

g(yi)gj{ x i) 4 - *-g(y r )gj(Xr)= 0 ( 10 . 10 ) 

Multiply (10.9) by g(yi) and (10.10) by y\ and subtract, to get 

[y2g(y\) - g{yi)y\]gj(x 2 ) h -b \yrg(yi) - g(y r )yi]gj{x r ) = o 

This is a system of equations like (10.9) but with fewer terms, which gives a contra¬ 
diction unless all the coefficients 


yig{y\)-y\g{yi) 


are zero. If this happens then 


yo'i 1 =g(yiy i 1 ) 

for all g € G, so that y,V| 1 G Kq. Thus there exist zi,..,,z r £ Kq and an element 
k € K such that y,- = kn for all i. Then (10.9), with j = 1, becomes 


At 1 tel H - Yx r kz r = 0 

and since k 0 we may divide by k, which shows that the x, are linearly dependent 
over Kq. This is a contradiction. 

Therefore [K : /Go] is not less than n and not greater than n, so [/if : Kf = n = |G| 
as required. □ 

Corollary 10.6. If G is the Galois group of the finite extension L : K, and H is a finite 
subgroup ofG, then 

[H^:K} = [L:K]/\H\ 

Proof. By the Tower Law, [L:K] = [L: H‘][H' : K], so [// f : K] = [L : K]/[L : H']. 
But this equals [L : K]/\H\ by Theorem 10.5. □ 
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Examples 10.7. We illustrate Theorem 10.5 by two examples, one simple, the other 
more intricate. 

(1) Let G be the group of automorphisms of C consisting of the identity and complex 
conjugation. The fixed field of G is R, for if x— iy = x + iy (x,y £ R) then y = 0, and 
conversely. Hence [C : R] = |G| = 2, a conclusion which is manifestly correct. 

(2) Let K = Q(£) where £ = exp(27n/5) € C. Now £ 5 = 1 and Q(£) consists of all 
elements 

P + q£ + rC 2 + sC 3 +tC 4 ( 10 . 11 ) 

where p,q,r,s,t £ Q. The Galois group of Q(£) : Q is easy to find, for if a is a 
Q-automorphism of Q(£) then 

(a(C )) 5 = «(C 5 ) = «(l) = l, 

so that a(£) = £ 2 , £ 3 , or £ 4 . This gives four candidates for Q-automorphisms: 

O'! : p + q^ + r^ 2 + s £ 3 + f £ 4 p + q£ + r£ 2 + st^ 3 +t£ 4 
CC 2 : p + s^+q^ 2 + t £ 3 +r £ 4 

a 3 : p + rl+t£ 2 + ql 3 +sl 4 

0 : 4 : p + t£+s£ 2 + r£ 3 +q£ 4 

It is easy to check that all of these are Q-automorphisms. The only point to bear 
in mind is that 1, £ 2 , £ 3 , £ 4 are not linearly independent over Q. However, their 

linear relations are generated by just one: £ + £ 2 + £ 3 + £ 4 = — 1 , and this relation 
is preserved by all of the candidate Q-automorphisms. 

Alternatively, observe that £,£ 2 ,£ 3 ,£ 4 all have the same minimal polynomial 
t 4 +t 3 +t 2 +t + 1 and use Corollary 5.13. 

We deduce that the Galois group of Q(£) : Q has order 4. It is easy to find 
the fixed field of this group: it turns out to be Q. Therefore, by Theorem 10.5, 
[Q(0 : Q] = 4. At first sight this might seem wrong, for equation ( 10 . 11 ) expresses 
each element in terms of five basic elements; the degree should be 5. In support of 
this contention, £ is a zero of t 5 — 1. The astute reader will already have seen the 
source of this dilemma: r 5 — 1 is not the minimal polynomial of C over Q, since it 
is reducible. The minimal polynomial is, as we have seen, t 4 +t 3 + t 2 + t+ 1, which 
has degree 4. Equation (10.11) holds, but the elements of the supposed ‘basis’ are 
linearly dependent. Every element of Q(£) can be expressed uniquely in the form 

p + qZ + rf + st; 3 

where p,q,r,s £ Q. We did not use this expression because it lacks symmetry, making 
the computations formless and therefore harder. 


EXERCISES 


10.1 Check Theorem 10.5 for the extension C(fi C(si,... ,s„) of Chapter 8 

Section 8.7. 
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10.2 Find the fixed field of the subgroup {a;, 0 : 4 } for Example 10.7(2). Check that 
Theorem 10.5 holds. 

10.3 Parallel the argument of Example 10.7(2) when £ = e 27r,//7 . 

10.4 Find all monomorphisms Q — > C. 

10.5 Mark the following true or false. 

(a) If S C T is a finite set and |S| = |T|, then S = T. 

(b) The same is true of infinite sets. 

(c) There is only one monomorphism Q —> Q. 

(d) If K and L are subfields of C, then there exists at least one monomorphism 
K^L. 

(e) Distinct automorphisms of a field K are linearly independent over K. 

(f) Linearly independent monomorphisms are distinct. 
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Chapter 11 

Field Automorphisms 


The theme of this chapter is the construction of automorphisms to given speci¬ 
fications. We begin with a generalisation of a /('-automorphism, known as a K- 
monomorphism. For normal extensions we shall use /('-monomorphisms to build up 
/('-automorphisms. Using this technique, we can calculate the order of the Galois 
group of any finite normal extension, which combines with the result of Chapter 10 
to give a crucial part of the fundamental theorem of Chapter 12. 

We also introduce the concept of a normal closure of a finite extension. This 
useful device enables us to steer around some of the technical obstructions caused by 
non-normal extensions. 


11.1 /('-Monomorphisms 

We begin by generalising the concept of a /('-automorphism of a subfield L of C, 
by relaxing the condition that the map should be onto. We continue to require it to be 
one-to-one. 

Definition 11.1. Suppose that K is a subfield of each of the subfields M and L ofC. 
Then a K-monomorphism ofM into L is afield monomorphism 0 : M —» L such that 
0 ( k ) = kfor every k £ K. 

Example 11.2. Suppose that K = Q,M = Q ( a) where a is a real cube root of 2, and 
L = C. We can define a K -monomorphism 0 : M — > L by insisting that 0(a) - coa , 
where ft) = er m ^. In more detail, every element of M is of the form p + qa + ra 2 
where p,q,r£ Q, and 


0 (p + qa + ra 2 ) = p + qcoa + rco 2 a 2 

Since a and coa have the same minimal polynomial, namely f 3 —2, Corollary 5.13 
implies that 0 is a /('-monomorphism. 

There are two other /('-monomorphisms M /, in this case. One is the identity, 
and the other takes a to co 2 a (see Figure 18). 

In general if K C M C L then any /(-automorphism of L restricts to a K- 
monomorphism M —> L. We are particularly interested in when this process can be 
reversed. 
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FIGURE 18: Images of Q-monomorphisms of a = Q(v / 2) : Q. 


Theorem 11.3. Suppose that L : K is a finite normal extension and K C M C L. Let 
X be any K-monomorphism M —>■ L. Then there exists a K-automorphism G ofL such 
that o\m = x. 

Proof. By Theorem 9.9, L is the splitting field over K of some polynomial / over K. 
Hence it is simultaneously the splitting field over M for / and over x(M) for x (/). 
But x\k is the identity, so x (/) = /. We have the diagram 

M -» L 
x l | a 
x (M) -» L 

with G yet to be found. By Theorem 9.6, there is an isomorphism <7 : L —»L such that 
<T | m = X. Therefore a is an automorphism of L, and since g\k = x\k is the identity, 
a is a /^-automorphism of L. □ 

This result can be used to construct /^automorphisms: 

Proposition 11.4. Suppose that L : K is a finite normal extension, and a,fi are zeros 
in L of the irreducible polynomial p over K. Then there exists a K-automorphism G 
ofL such that G(a) = fi. 

Proof By Corollary 5.13 there is an isomorphism x : K(a) —> K(f5) such that x\k 
is the identity and x (a) = fi. By Theorem 11.3, X extends to a /("-automorphism cr 
ofL. □ 


11.2 Normal Closures 

When extensions are not normal, we can try to recover normality by making the 
extensions larger. 

Definition 11.5. Let L be a finite extension of K. A normal closure of L : K is an 
extension N of L such that 
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(1) N : K is normal; 

(2) If L C M C IV and M : K is normal, then M = N. 

Thus N is the smallest extension of L that is normal over K. 

The next theorem assures us of a sufficient supply of normal closures, and shows 
that (working inside C) they are unique. 

Theorem 11.6. If L : K is a finite extension in C, then there exists a unique normal 
closure N CC ofL : K, which is a finite extension of K. 

Proof. Let x\ ,..., x r be a basis for L over K, and let nij be the minimal polynomial of 
Xj over K. Let N be the splitting field for / = m\ni 2 ■ ■ -in,- over L. Then N is also the 
splitting field for / over K, so N : K is normal and finite by Theorem 9.9. Suppose 
that LC P CN where P : K is normal. Each polynomial mj has a zero xj £ l\ so by 
normality / splits in P. Since N is the splitting field for /, we have P = N. Therefore 
N is a normal closure. 

Now suppose that M and N are both normal closures. The above polynomial / 
splits in M and in /V, so each of M and N contain the splitting field for / over K. This 
splitting field contains L and is normal over K, so it must be equal to both M and 
N. □ 

Example 11.7. Consider Q(a):Q where a is the real cube root of 2. This extension 
is not normal, as we have seen. If we let K be the splitting field for f 3 — 2 over Q, 
contained in C, then K = Q(a,am, aco 2 ) where co = (—1 +iV 3)/2 is a complex 
cube root of unity. This is the same as Q(a,co). Now K is the normal closure for 
Q (a): Q. So here we obtain the normal closure by adjoining all the ‘missing’ zeros. 
Normal closures let us place restrictions on the image of a monomorphism. 

Lemma 11.8. Suppose that K C L C N C M where L : K is finite and N is the normal 
closure ofL : K. Let x be any K-monomorphism L —> M. Then x (L) C N. 

Proof. Let a £ L. Let m be the minimal polynomial of a over K. Then m(a) = 0 so 
X (m(a)) =0. But x (m(a)) =m(x(a)) since x is a ICmonomorphism, som(x(a)) = 
0 and x(a) is a zero of m. Therefore x(a) lies in N since N : K is normal. Therefore 
x (L) C N. 

This result often lets us restrict attention to the normal closure of a given ex¬ 
tension when discussing monomorphisms. The next theorem provides a sort of con¬ 
verse. □ 

Theorem 11.9. For a finite extension L : K the following are equivalent: 

(1) L.K is normal. 

(2) There exists a finite normal extension N of K containing L such that every 
K-monomorphism X : L —> N is a K-automorphism ofL. 

(3) For every finite extension M of K containing L, every K-monomorphism X : 
L —> M is a K-automorphism ofL. 
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Proof. We show that (1) => (3) => (2) =>• (1). 

(1) =>• (3). If L : K is normal then L is the normal closure of L . K, so by 
Lemma 11.8, X(L) C L. But T is a /("-linear map defined on the finite-dimensional 
vector space L over K , and is a monomorphism. Therefore t(L) has the same dimen¬ 
sion as L , whence x(L) = L and T is a //-automorphism of L. 

(3) => (2). Let N he the normal closure for L : K. Then N exists by Theorem 11.6, 
and has the requisite properties by (3). 

(2) => (1). Suppose that / is any irreducible polynomial over K with a zero a £ L. 

Then / splits over N by normality, and if (3 is any zero of / in N , then by Proposi¬ 
tion 11.4 there exists an automorphism a of N such that oia) = [5. By hypothesis, 
<7 is a //-automorphism of L, so p = a ( a. ) £ o(L) = L. Therefore / splits over L and 
L.K is normal. □ 

Our next result is of a more computational nature. 

Theorem 11.10. Suppose that L : K is a finite extension of degree n. Then there are 
precisely n distinct K-monomorphisms of L into the normal closure N of L : K, and 
hence into any given normal extension M of K containing L. 

Proof Use induction on [.L : K\. If [L: K} = 1, then the result is clear. Suppose that 
[L:K]=k > 1. Let a G L\K with minimal polynomial m over K. Then 

dm = [//(a): K] = r > 1 

Now m is an irreducible polynomial over a subfield of C with one zero in the normal 
extension N , so m splits in N and its zeros a\,...,a r are distinct. By induction there 
are precisely s distinct AT (a) -monomorphisms pi,... ,p s : L —> N, where s = [L : 
//(a)] = k/r. By Proposition 11.4, there are r distinct //-automorphisms X\, ..., X r of 
N such that xfa) = a,-. The maps 

(/)ij = Xipj (1 <i<r. \ <j<s) 

are /Z-monomorphisms L N. 

We claim they are distinct. Suppose <f)jj = (pi,/. Then X k 1 T, = p/p ; 1 . The p, fix 
K(a), so they map a to itself. But p ; is defined by its action on a, so p/p ; 1 is the 
identity. That is, pi = Pj. So r,. 1 r, is the identity, and Xi- = x\. Therefore i = k,j = l, so 
the P/ j are distinct. They therefore provide rs = k distinct /Z-monomorphisms L N. 

Finally, we show that these are all of the /Z-monomorphisms L^N. Let x : L —> N 
be a /Z-monomorphism. Then x(a) is a zero of m in N, so X(a) = a, for some i. The 
map & = xf l x is a //(a)-monomorphism L —> N, so by induction 0 = pj for some 
Hence x = xiPj = p /; and the theorem is proved. □ 

We can now calculate the order of the Galois group of a finite normal extension, 
a result of fundamental importance. 

Corollary 11.11. If L : K is a finite normal extension inside C, then there are pre¬ 
cisely [L : K] distinct K-automorphisms ofL. That is, 

|F(L : K)\ = [L: K] 
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Proof. Use Theorem 11.10. □ 

From this we easily deduce the important: 

Theorem 11.12. Let L.K be a finite extension with Galois group G. If L : K is 
normal, then K is the fixed field ofG. 

Proof Let Kq be the fixed field of G , and let [L: K] =n. Corollary 11.11 implies that 
|G| = n. By Theorem 10.5, [L : Kq] = n. Since K C Kq we must have K = Kq. □ 

An alternative and in some ways simpler approach to Corollary 11.11 and Theo¬ 
rem 11.12 can be found in Geek (2014). 

There is a converse to Theorem 11.12, which shows why we must consider nor¬ 
mal extensions in order to make the Galois correspondence a bijection. Before we can 
prove the converse, we need a theorem whose statement and proof closely resemble 
those of Theorem 11.10. 

Theorem 11.13. Suppose that K C L C M and M : K is finite. Then the number of 
distinct K-monomorphisms L —» M is at most [L : K\. 

Proof. Let A be a normal closure of M : K. Then the set of k'-monomorphisms L 
M is contained in the set of A-monomorphisms L —> N, and by Theorem 11.10 there 
are precisely [L : K] of those. □ 

Theorem 11.14. If L is any field, G any finite group of automorphisms of L, and K 
is its fixed field, then L : K is finite and normal, with Galois group G. 

Proof. By Theorem 10.5, [L : K\ = |G| = n, say. There are exactly n distinct K- 
monomorphisms L —> L, namely, the elements of the Galois group. 

We prove normality using Theorem 11.9. Thus let N be an extension of K con¬ 
taining L, and let T be a k'-monomorphism L —> N. Since every element of the Ga¬ 
lois group of L.K defines a k'-monomorphism L —> N, the Galois group provides n 
distinct k'-monomorphisms L —» N, and these are automorphisms of L. But by The¬ 
orem 11.13 there are at most n distinct k'-monomorphisms L — t N, so T must be one 
of these monomorphisms. Hence T is an automorphism of L. Finally, L.K is normal 
by Theorem 11.9. □ 

If the Galois correspondence is a bijection, then K must be the fixed field of the 
Galois group of L.K, so by the above L : K must be normal. That these hypotheses 
are also sufficient to make the Galois correspondence bijective (for subfields of C) 
will be proved in Chapter 12. For general fields we need the additional concept of 
‘separability’, see Chapter 17. 


EXERCISES 


11.1 Suppose that L.K is finite. Show that every /("-monomorphism L —> L is an 
automorphism. Does this result hold if the extension is not finite? 
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11.2 Construct the normal closure N for the following extensions: 

(a) Q(a):Q where a is the real fifth root of 3 

(b) Q(j3):Q where /3 is the real seventh root of 2 

(c) Q(x/2,V3):Q 

(d) Q(a, y/2): Q where a is the real cube root of 2 

(e) Q(y):Q where y is a zero of r 3 — 3r 2 + 3 

11.3 Find the Galois groups of the extensions (a), (b), (c), fd) in Exercise 11.2. 

11.4 Find the Galois groups of the extensions IV : Q for their normal closures N. 

11.5 Show that Lemma 11.8 fails if we do not assume that IV : K is normal, but is 
true for any extension IV of L such that N : K is normal, rather than just for a 
normal closure. 

11.6 Use Corollary 11.11 to find the order of the Galois group of the extension 
Q(v/3, \/5, v/7): Q- (Hint: Argue as in Example 6.8.) 

11.7 Mark the following true or false. 

(a) Every /Gmonomorphism is a /('-automorphism. 

(b) Every finite extension has a normal closure. 

(c) If K C L C M and a is a //-automorphism of M, then the restriction o\l 
is a //-automorphism of L. 

(d) An extension having Galois group of order 1 is normal. 

(e) A finite normal extension has finite Galois group. 

(f) Every Galois group is abelian (commutative). 

(g) The Galois correspondence fails to be bijective for non-normal exten¬ 
sions. 

(h) A finite normal extension inside C, of degree n, has Galois group of order 

n. 

(i) The Galois group of a normal extension is cyclic. 



Chapter 12 

The Galois Correspondence 


We are at last in a position to establish the fundamental properties of the Galois 
correspondence between a field extension and its Galois group. Most of the work has 
already been done, and all that remains is to put the pieces together. 


12.1 The Fundamental Theorem of Galois Theory 

Let us recall a few points of notation from Chapter 8. Let L.K be a field extension 
in C with Galois group G, which consists of all /C-automorphisms of L. Let ■'¥ be the 
set of intermediate fields, that is, subfields M such that K C M C L, and let be the 
set of all subgroups H of G. We have defined two maps 

* : & 
f : Sf 

as follows: if M £ then M* is the group of all M-automorphisms of L. If H £ Sf, 
then H' is the fixed field of H. We have observed in (8.4) that the maps * and + 
reverse inclusions. 

Before proceeding to the main theorem, we need a lemma: 

Lemma 12.1. Suppose that L : K is afield extension, M is an intermediate field, and 
T is a K-automorphism ofL. Then %(M)* = . 

Proof. Let M’ = t(M), and take y £ M*,x i £ M 1 . Then x\ = x{x) for some x £ M. 
Compute: 

(TyT _1 )(xi) = ry(x) = z(x) =x\ 

so T M*x~ l C M'*. Similarly C M*, so L M'*, and the lemma is 

proved. □ 

We are now ready to prove the main result: 

Theorem 12.2 (Fundamental Theorem of Galois Theory). IfL : K is a finite nor¬ 
mal field extension inside C, with Galois group G, and if &are defined as 
above, then: 

(1) The Galois group G has order [L : K], 
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(2) The maps * and ' are mutual inverses, and set up an order-reversing one-to- 
one correspondence between & and Sf. 

(3) IfM is an intermediate field, then 

[L: M\ = \M *| [M:K] = \G\/\M*\ 

(4) An intermediate field M is a normal extension ofK if and only ifM* is a normal 
subgroup of G. 

(5) If an intermediate field M is a normal extension ofK, then the Galois group of 
M : K is isomorphic to the quotient group G/M*. 

Proof Part (1) is a restatement of Corollary 11.11. 

For part (2), suppose that M is an intermediate field, and let [L : M] = d. Then 
\M*\ = d by Theorem 10.5. On the other hand, if H is a subgroup of G of order 
d , then [L : II '} = d by Corollary 11.11. Hence the composite operators *f and t* 
preserve [L : M] and \H\ respectively. 

From their definitions, M"' D M and H r * D H. Therefore these inclusions are 
equalities. 

For part (3), again note that L.M is normal. Corollary 11.11 states that [L:M] = 

| M* |, and the other equality follows immediately. 

We now prove part (4). If M : K is normal, let t 6 G. Then t| m is a K- 
monomorphism M —>■ L, so is a /('-automorphism of M by Theorem 11.9. Hence 
T (M) = M. By Lemma 12.1, = M*, so M* is a normal subgroup of G. 

Conversely, suppose that M* is a normal subgroup of G. Let cr be any K- 
monomorphism M —r L. By Theorem 11.3, there is a IT-automorphism T of L such 
that t| m = cr. Now = M* since M* is a normal subgroup of G, so by 

Lemma 12.1, T (M)* =M*. Bypart2 of Theorem 12.2, T (M) =M. Hence cr(M) =M 
and a is a /('-automorphism of M. By Theorem 11.9, M : K is normal. 

Finally we prove part (5). Let G' be the Galois group of M : K. We can define a 
map 0 : G —> G' by 

0(t) = t| m teG 

This is clearly a group homomorphism G —> G', for by Theorem 11.9 t| m is a K- 
automorphism of M. By Theorem 11.3, (!) is onto. The kernel of (!) is obviously M *, 
so by standard group theory 

G! = im(0) ^ G/ker(>) = G/M* 

where im is the image and ker the kernel. □ 

Note how Theorem 10.5 is used in the proof of part (2) of Theorem 12.2: its use 
is crucial. Many of the most beautiful results in mathematics hang by equally slender 
threads. 

Parts (4) and (5) of Theorem 12.2 can be generalized: see Exercise 12.2. Note 
that the proof of part (5) provides an explicit isomorphism between F(M : K) and 
G/M* , namely, restriction to M. 



Exercises 


153 


The importance of the Fundamental Theorem of Galois Theory derives from its 
potential as a tool rather than its intrinsic merit. It enables us to apply group theory to 
otherwise intractable problems about polynomials over C and associated subfields of 
C, and we shall spend most of the remaining chapters exploiting such applications. 


EXERCISES 

12.1 Work out the details of the Galois correspondence for the extension 

Q(i,V5):Q 

whose Galois group is G = {I : R,S, T} as in Chapter 8 . 

12.2 Let L.K be a finite normal extension in C with Galois group G. Suppose that 
M,N are intermediate fields with M C N. Prove that N : M is normal if and 
only if N* is a normal subgroup of M*. In this case prove that the Galois group 
of AC Mis isomorphic to M* /N*. 

12.3* Let 7 = \/2 + V2. Show that Q(y): Q is normal, with cyclic Galois group. 
Show that Q(y,0=Q(jU) where / 1 4 = i. 

12.4* Find the Galois group oft 6 — 1 over Q. 

12.5* Find the Galois group of t 6 — 2f 3 — 1 over Q. 

12.6 Let £ = e ™/ 6 be a primitive 12th root of unity. Find the Galois group r(Q(£): 
Q) as follows. 

(a) Prove that £ is a zero of the polynomial t 4 — t 2 + 1, and that the other 
zeros are £ 5 , £ 7 , £ n . 

(b) Prove that t 4 — t 2 +1 is irreducible over Q, and is the minimal polynomial 
of £ over Q. 

(c) Prove that r(Q(£) : Q) consists of four Q-automorphisms (j>j, defined by 

hi Q = C j j = 1,5,7,11 

fd) Prove that r(Q(£): Q) = Z 2 x Z 2 . 

12.7 Using the subgroup structure of Z 2 x Z 2 as in Exercise 12.6, find all inter¬ 
mediate fields between Q and Q(£). [Hint: Calculate the fixed fields of the 
subgroups.] 

12.8 Mark the following true or false. 
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(a) If L : K is a finite normal extension inside C, then the order of the Galois 
group of L : K is equal to the dimension of L considered as a vector space 
over K. 

(b) If M is any intermediate field of a finite normal extension inside C, then 

= M. 

(c) If M is any intermediate field of a finite normal extension inside C, then 
M* f = M. 

fd) If M is any intermediate field of a finite normal extension L : K inside C, 
then the Galois group of M : K is a subgroup of the Galois group of L.K. 

(e) If M is any intermediate field of a finite normal extension L : K inside C, 
then the Galois group of L : M is a quotient of the Galois group of L.K. 



Chapter 13 

A Worked Example 


The Fundamental Theorem of Galois theory is quite a lot to take in at one go, so it is 
worth spending some time thinking it through. We therefore analyse how the Galois 
correspondence works out on an extended example. 

The extension that we discuss is a favourite with writers on Galois theory, because 
of its archetypal quality. A simpler example would be too small to illustrate the theory 
adequately, and anything more complicated would be unwieldy. The example is the 
Galois group of the splitting field of f 4 — 2 over Q. 

The discussion will be cut into small pieces to make it more easily digestible. 

(1) Let f(t) = r 4 — 2 over Q, and let K be a splitting field for / such that K C C. We 
can factorise / as follows: 

/(f) = (f-§)(f + §)(f-i§)(f + i§) 

where B, = \[2 is real and positive. Therefore K = Q(<*, i). Since K is a splitting field, 
K : Q is finite and normal. We are working in C, so separability is automatic. 

(2) We find the degree of K : Q. By the Tower Law, 

The minimal polynomial of i over Q(<!;) is t 2 + 1, since r + 1 = 0 but i 
So [Q(§,i):Q(§)]=2. 

Now B, is a zero of / over Q, and / is irreducible by Eisenstein’s Criterion, 
Theorem 3.19. Hence / is the minimal polynomial of B, over Q, and [Q(^) : Q] = 4. 
Therefore 

[K: Q] = 2.4 = 8 

(3) We find the elements of the Galois group of K : Q. By a direct check, or by 
Corollary 5.13, there are Q-automorphisms (T, T of K such that 

c(i) = i ct(§) = 

t(/) = -i t(|) = B, 

Products of these yield eight distinct Q-automorphisms of K: 
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A Worked Example 


Automorphism Effect on t, Effect on i 


1 

cr iq i 

a 2 -t, i 

a 3 —it, i 

r % -i 

<7T it, -i 

a 2 x -t, —i 

cj 3 t -it, 


Other products do not give new automorphisms, since a 4 = 1 ,T 2 = 1, TC 7 = 
cr 3 T, x< 7 2 = <J 2 x, Tct 3 = ctt. (The last two relations follows from the first three.) 
Any Q-automorphism of K sends i to some zero of f 2 + 1. so i ±f; similarly t, is 
mapped to ff iff q, or iq. All possible combinations of these (eight in number) 
appear in the above list, so these are precisely the Q-automorphisms of K. 

(4) The abstract structure of the Galois group G can be found. The generator-relation 
presentation 

G=(a, T:C7 4 = T 2 = 1, TC7 = C7 3 t) 

shows that G is the dihedral group of order 8 , which we write as O 4 . (In some books 
the notation ID X is used instead. It depends on what you think is important: the order 
is 8 or there is a normal subgroup Z 4 .) 

The group O 4 has a geometric interpretation as the symmetry group of a square. 
In fact we can label the four vertices of a square with the zeros of r 4 — 2, in such a 
way that the geometric symmetries are precisely the permutations of the zeros that 
occur in the Galois group (Figure 19). 



FIGURE 19: The Galois group O 4 interpreted as the symmetry group of a square. 




157 


(5) It is an easy exercise to find the subgroups of G. If as usual we let Z n denote the 
cyclic group of order n, and x the direct product, then the subgroups are as follows: 


Order 8 : G 

G — O 4 

Order 4: {l,c,cr 2 ,c7 3 } 

S^Z 4 

{LC7 2 ,T,C7 2 T} 

7=Z 2 xZ 2 

{1,C7 2 ,OT,C7 3 t} 

U = Z 2 x Z 2 

Order 2: { 1 , cr 2 } 


{l.T} 

b^z 2 

{1,C7T} 

c^z 2 

{1, C7 2 T> 

d^z 2 

{1, <7 3 t} 

e^z 2 

Order 1 : { 1 } 

7=1 


(6) The inclusion relations between the subgroups of G can be summed up by the 
lattice diagram of Figure 20. In such diagrams, X C Y if there is a sequence of 
upward-sloping lines from X to Y. 


G 



D B A C E 



I 

FIGURE 20: Lattice of subgroups. 


(7) Under the Galois correspondence we obtain the intermediate fields. Since the 
correspondence reverses inclusions, we obtain the lattice diagram in Figure 21. 



FIGURE 21: Lattice of subfields. 
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(8) We now describe the elements of these intermediate fields. There are three ob¬ 
vious subfields of K of degree 2 over Q, namely Q(t), Q(\/2), Q(t\/2). These are 
clearly the fixed fields S'. T ', and U\ respectively. The other fixed fields are less 
obvious. To illustrate a possible approach we shall find C 4 . Any element of K can be 
expressed uniquely in the form 

x = gq ai^ a2^~ Aa?,^~ A a^i T- a$i^ a^i^ T a~ji^~ 

where ao,...,aj gQ. Then 

ot(x) = ao +a\i<^ — a 2 % 2 — — a^i + as(—i)iB, — a(,i{i^) 2 — 

= ao + a^B, —Q 2 ^ 2 — a^i + a\iB, A-a^iB, 2 

The element x is fixed by a T (and hence by C) if and only if 

do = ClQ Cl\ = (2$ Cl2 = — a 2 a 3 = — 

CI4 = — CI4 CI5 = Cl \ Cl = Cl(, CX~i — — CI3 

Therefore c/q and are arbitrary, while 


G2 = 0 = fl4 fli = «5 a$ = —ai 


It follows that 


x — ao + ai (1 + i)B, + a^iB, 2 + 03 (1 — i)^j^ 

= «0 + «i[(l +i)$\ + y[(l ~ ^[(1 +0 l s] 3 

which shows that 

C t =Q((l + i)^) 

Similarly, 

A'=Q(i,V 2) B f =Q(|) 0 + =Q(^) = Q((l — 1 )|) 

It is now easy to verify the inclusion relations specified by the lattice diagram in 
Figure 21. 

(9) It is possible, but tedious, to check by hand that these are the only intermediate 
fields. 

(10) The normal subgroups of G are G, S, T,U,A,I. By the Fundamental Theorem of 
Galois theory, G T ,S 4 ,7’ 4 ,I/' .A .!' should be the only normal extensions of Q that 
are contained in K. Since these are all splitting fields over Q, for the polynomials 
t, f 2 + l, r 2 — 2 , t 2 + 2 , r 4 —f 2 — 2 , f 4 — 2 (respectively), they are normal extensions 
of Q. On the other hand B : Q 1 is not normal, since f 4 — 2 has a zero, namely B ,, in 
B 4 but does not split in B . Similarly C‘. I) '. E ' are not normal extensions of Q. 

(11) According to the Fundamental Theorem of Galois theory, the Galois group 
of A 4 : Q is isomorphic to G/A. Now G/A is isomorphic to Z 2 x Z 2 . We calcu¬ 
late directly the Galois group of A 4 : Q. Since A 4 = Q (i, \/2) there are four Q- 
automorphisms: 
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Automorphism Effect on i 

Effect on \J2 

1 i 

s/2 

a i 

-\/2 

p 

V2 

ap -i 

-y/2 


and since a 2 = P 2 = 1 and ap = Pa, this group is Z2 x Z2 as expected. 

(12) The lattice diagrams for & and do not look the same unless one of them 
is turned upside-down. Hence there does not exist a correspondence like the Galois 
correspondence but preserving inclusion relations. It may seem a little odd at first 
that the Galois correspondence reverses inclusions, but in fact it is entirely natural, 
and quite as useful a property as preservation of inclusions. 

It is in general a difficult problem to compute the Galois group of a given field 
extension, particularly when there is no explicit representation for the elements of 
the large field. See Chapter 22. 


EXERCISES 

13.1 Find the Galois groups of the following extensions: 

(a) Q(\/2,x/5):Q 

(b) Q(a): Q where a = e 2m / 3 . 

(c) K : Q where K is the splitting field over Q for t 4 — 31 2 + 4. 

13.2 Find all subgroups of these Galois groups. 

13.3 Find the corresponding fixed fields. 

13.4 Find all normal subgroups of the above Galois groups. 

13.5 Check that the corresponding extensions are normal. 

13.6 Verify that the Galois groups of these normal extensions are the relevant quo¬ 
tient groups. 

13.7* Consider the Galois group of t 6 — 7 over Q, found in Exercise 12.4. Use the 
Galois correspondence to find all intermediate fields. 

13.8* Consider the Galois group of t 6 — 2r 3 — 1 over Q, found in Exercise 12.5. Use 
the Galois correspondence to find all intermediate fields. 
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13.9 Find the Galois group of r 8 — i over Q(i). 

13.10 Find the Galois group of r 8 +t 4 + 1 over Q(i). 

3.11 Use the Galois group Z2 x Z2 x Z2 of Q(\/2, V3, V$) ■ Q to find all interme¬ 
diate fields. Which of these are normal over Q? 

13.12 Mark the following true or false. 

(a) A 3 x 3 square has exactly 9 distinct symmetries. 

(b) The symmetry group of a square is isomorphic to Zg. 

(c) The symmetry group of a square is isomorphic to Sg. 

fd) The symmetry group of a square is isomorphic to a subgroup of § 8 . 

(e) The group O 4 has 10 distinct subgroups. 

(f) The Galois correspondence preserves inclusion relations. 

(g) The Galois correspondence reverses inclusion relations. 



Chapter 14 

Solubility and Simplicity 


In order to apply the Galois correspondence, in particular to solving equations by 
radicals, we need to have at our fingertips a number of group-theoretic concepts and 
theorems. We have already assumed familiarity with elementary group theory: sub¬ 
groups, normal subgroups, quotient groups, conjugates, permutations (up to cycle 
decomposition): to these we now add the standard isomorphism theorems. The rel¬ 
evant theory, along with most of the material in this chapter, can be found in any 
basic textbook on group theory, for example Fraleigh (1989), Humphreys (1996), or 
Neumann, Stoy, and Thompson (1994). 

We start by defining soluble groups and proving some basic properties. These 
groups are of cardinal importance for the theory of the solution of equations by radi¬ 
cals. Next, we discuss simple groups, the main target being a proof of the simplicity 
of the alternating group of degree 5 or more. We end by proving Cauchy’s Theorem: 
if a prime p divides the order of a finite group, then the group has an element of order 
P- 


14.1 Soluble Groups 

Soluble groups were first defined and studied (though not in the current abstract 
way) by Galois in his work on the solution of equations by radicals. They have since 
proved extremely important in many branches of mathematics. 

In the following definition, and thereafter, the notation H<J G will mean that H is 
a normal subgroup of the group G. Recall that an abelian (or commutative) group is 
one in which gh = hg for all elements g.h. 

Definition 14.1. A group G is soluble (in the US: solvable) if it has a finite series of 
subgroups 

1 =G 0 CGi C ... C G» = G (14.1) 

such that 

(1) G/<G,-+i fori = 0,1. 

(2) G;+i /G, is abelian for i = 0,..., n — 1. 

Condition (14.1) does not imply that G,<lG, since G,<1 G, + i<] G i+ 2 does not imply 
G,< G, + 2 - See Exercise 14.10. 
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Solubility and Simplicity 


Examples 14.2. (1) Every abelian group G is soluble, with series 1< G. 

(2) The symmetric group S 3 of degree 3 is soluble, since it has a cyclic normal sub¬ 
group of order 3 generated by the cycle (123) whose quotient is cyclic of order 2. All 
cyclic groups are abelian. 

(3) The dihedral group IDx of order 8 is soluble. In the notation of Chapter 13, it has 
a normal subgroup S of order 4 whose quotient has order 2, and S is abelian. 

(4) The symmetric group S 4 of degree 4 is soluble, having a series 

I<Y<1A4<1S4 

where A4 is the alternating group of order 12, and V is the Klein four-group, which 
we recall consists of the permutations 1, (12)(34),(13)(24), (14)(23) and hence is a 
direct product of two cyclic groups of order 2. The quotient groups are 

V/1 = V abelian of order 4 
A 4 /V = Z 3 abelian of order 3 
S4/A4 = Z 2 abelian of order 2. 

(5) The symmetric group S5 of degree 5 is not soluble. This follows from 
Lemma 8.11 with a bit of extra work. See Corollary 14.8. 

We recall the following isomorphism theorems: 

Lemma 14.3. Let G. //. and A be groups. 

(1) If H<\G and A QG then HC\A<\A and 

A _HA 
HHA ~ IT 

(2) IfHoG, andH CA<G then H<A,A/H<G/H and 

G/H _ G 
AfH ~ A 

(3) If H<lG and A /H<\ G/H then A < G. 

Parts (1) and (2) are respectively the First and Second Isomorphism Theorems. 
They are the translation into normal subgroup language of two straightforward facts: 
restricting a homomorphism to a subgroup yields a homomorphism, and compos¬ 
ing two homomorphisms yields a homomorphism. See Exercise 14.11. Part (3) is a 
converse to part ( 2 ) and is easy to prove. 

Judicious use of these isomorphism theorems lets us prove that soluble groups 
persist in being soluble even when subjected to quite drastic treatment. 

Theorem 14.4. Let G be a group, H a subgroup of G, and N a normal subgroup of 
G. 
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(1) If G is soluble, then H is soluble. 

(2) If G is soluble, then G/N is soluble. 

(3) If N and G/N are soluble, then G is soluble. 

Proof. (l)Let 

1 = Go<] G\ <l... <l G r = G 

be a series for G with abelian quotients G,-+i/G;. Let H\ = G, fl //. Then H has a 
series 

1 =H 0 <1 ...<lH r = H 
We show the quotients are abelian. Now 

Hj+ 1 _ G, +1 nfl _ G, +1 n H „ Gj(G i+ 1 Off) 

Hi GN U Gj(T (Gj+i HH) G, 

by the first isomorphism theorem. But this latter group is a subgroup of Gi+i/G,- 
which is abelian. Hence H- l+ \ /Hj is abelian for all i, and H is soluble. 

(2) Take G, as before. Then G/N has a series 

N/N = GqN/N< G X N/N< ...< G r N/N = G/N 
A typical quotient is 

Gj+iN/N 

GiN/N 

which by the second isomorphism theorem is isomorphic to 

G, + 1 N _ G i+ i(GjN) ^ Gj + 1 ^ G,- + i/G,- 

GiN GjN _ G/+i n (G,W) " (G/+i n (G,-JV))/G/ 

which is a quotient of the abelian group G;+i/G,-, so is abelian. Therefore G/N is 
soluble. 

(3) There exist two series 

1 = N 0 <Ni<\ ...<N r =N 
N/N = Gq/N<\ Gi /N< ...<G s /N = G/N 

with abelian quotients. Consider the series of G given by combining them: 

1 =A 0 <UVi<l ...<\N r =N = G 0 <\Gi< ...<\G S = G 

The quotients are either Ni+i/Nj (which is abelian) or G,+i/G,-, which is isomorphic 
to 

Gj+i/N 

Gt/N 

and again is abelian. Therefore G is soluble. □ 
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A group G is an extension of a group A by a group B if G has a normal subgroup 
N isomorphic to A such that G/N is isomorphic to B. We may sum up the three 
properties of the above theorem as: the class of soluble groups is closed under taking 
subgroups, quotients, and extensions. The class of abelian groups is closed under 
taking subgroups and quotients, but not extensions. It is largely for this reason that 
Galois was led to define soluble groups. 


14.2 Simple Groups 

We turn to groups that are, in a sense, the opposite of soluble. 

Definition 14.5. A group G is simple if it is nontrivial and its only normal subgroups 
are 1 and G. 

Every cyclic group 7L p of prime order is simple, since it has no subgroups other 
than 1 and Z p , hence in particular no other normal subgroups. These groups are also 
abelian, hence soluble. They are in fact the only soluble simple groups: 

Theorem 14.6. A soluble group is simple if and only if it is cyclic of prime order. 

Proof Since G is soluble group, it has a series 

1 = Go< G i <d ... <| G„ = G 

where by deleting repeats we may assume G/+ 1 / G,. Then G„ . | is a proper normal 
subgroup of G. However, G is simple, so G„_i = 1 and G = G n /G, ,_i, which is 
abelian. Since every subgroup of an abelian group is normal, and every element of G 
generates a cyclic subgroup, G must be cyclic with no non-trivial proper subgroups. 
Hence G has prime order. 

The converse is trivial. □ 

Simple groups play an important role in finite group theory. They are in a sense 
the fundamental units from which all finite groups are made. Indeed the Jordan- 
Holder theorem, which we do not prove, states that every finite group has a series of 
subgroups like (14.1) whose quotients are simple, and these simple groups depend 
only on the group and not on the series chosen. 

We do not need to know much about simple groups, intriguing as they are. We 
require just one result: 

Theorem 14.7. Ifn > 5, then the alternating group A n of degree n is simple. 

Proof. We use much the same strategy as in Lemma 8.11, but we are proving a rather 
stronger property, so we have to work a bit harder. 

Suppose that 1 ^ N<i A„. Our strategy will be as follows: first, observe that if N 
contains a 3-cycle then it contains all 3-cycles, and since the 3-cycles generate A„, 
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we must have N = A„. Second, prove that N must contain a 3-cycle. It is here that 
we need n> 5. 

Suppose then, that N contains a 3-cycle; without loss of generality N contains 
(123). Now for any k > 3 the cycle (32 k) is an even permutation, so lies in A„, and 
therefore 

(32k) (123) (32k)- 1 = (lk2) 

lies in N. Hence N contains (lk2) 2 = (12 k) for all k > 3. We claim that A„ is gener¬ 
ated by all 3-cycles of the form (12 k). If/; = 3 then we are done. If n > 3 then for all 
a,b > 2 the permutation (\a)(2b) is even, so lies in A„, and then A„ contains 

(la)(2b)(12k)((la)(2b))~ 1 = (abk) 

if k^a,b. Since A„ is generated by all 3-cycles (Exercise 8.7), it follows that IV = A„. 

It remains to show that N must contain at least one 3-cycle. We do this by an 
analysis into cases. 

(1) Suppose that N contains an element x = abc..., where a.b.c,... are disjoint 
cycles and 

a = (a i .. .a m ) (m > 4) 

Lett = (aiaiciT,). Then N contains t~ 1 xt. Since t commutes with b,c,... (disjointness 
of cycles) it follows that 


t 1 xt = (t 1 at)bc ...=z (say) 


so that N contains 


zx 1 = (aia^a,,,) 


which is a 3-cycle. 

(2) Now suppose N contains an element involving at least two 3-cycles. Without loss 
of generality N contains 

x= (123)(456)y 

where y is a permutation fixing 1, 2, 3, 4, 5, 6. Let t = (234). Then N contains 


(t x xt)x 1 = (12436) 


Then by case (1) N contains a 3-cycle. 

(3) Now suppose that N contains an element x of the form (i jk)p , where p is a product 
of 2-cycles disjoint from each other and from (ijk). Then N contains x 2 = ( ikj ), 
which is a 3-cycle. 

(4) There remains the case when every element of A is a product of disjoint 2-cycles. 
(This actually occurs when n = 4, giving the four-group Y.) But as n > 5, we can 
assume that N contains 

x=(12)(34)p 

where p fixes 1, 2, 3, 4. If we let 1 = (234) then N contains 

(t- l xt)x~ 1 = (14) (23) 
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and if u = (145)1V contains 

u~ l (t~ 1 xtx~ l )u = (45)(23) 

so that N contains 

(45)(23)(14)(23) = (145) 

contradicting the assumption that every element of IV is a product of disjoint 2-cycles. 
Hence A„ is simple if n > 5 . □ 

In fact A 5 is the smallest non-abelian simple group. This result is often attributed 
to Galois, but Neumann (2011), in his translation of Galois’s mathematical writings, 
points out on pages 384-385 that alternating groups are not mentioned in any sig¬ 
nificant work by Galois, and that the methods available to him were inadequate to 
eliminate various orders for a potential simple group, such as 56. Although it seems 
plausible that Galois knew that A„ is simple for n > 5, there is no clear evidence 
that he did. Indeed, his proof that the quintic cannot be solved by radicals uses other 
special features of the Galois group of an equation of prime degree: see Neumann 
(2011) chapter IV. We discuss this point further in Chapter 25. 

From this theorem we deduce: 

Corollary 14.8. The symmetric group §„ of degree n is not soluble ifn > 5. 

Proof. If S„ were soluble then A„ would be soluble by Theorem 14.4, and simple by 
Theorem 14.7, hence of prime order by Theorem 14.6. But | A„ | = 5 (n !) is not prime 
ifn >5. " □ 


14.3 Cauchy’s Theorem 

We next prove Cauchy’s Theorem: if a prime p divides the order of a finite group, 
then the group has an element of order p. We begin by recalling several ideas from 
group theory. 

Definition 14.9. Elements a and b of a group G are conjugate in G if there exists 
g € G such that a = g~ l bg. 

Conjugacy is an equivalence relation; the equivalence classes are the conjugacy 
classes of G. 

If the conjugacy classes of G are Ci,... ,C r , then one of them, say Ci, contains 
only the identity element of G. Therefore \C\ \ = 1. Since the conjugacy classes form 
a partition of G we have 

|G| = l + |C 2 |+--- + |C r | (14.2) 


which is the class equation for G. 
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Definition 14.10. If G is a group and xgG, then the centraliser Cg{x) of x in G is 
the set of all g £ G for which xg = gx. It is always a subgroup of G. 

There is a useful connection between centralisers and conjugacy classes. 

Lemma 14.11. IfG is a group and x G G, then the number of elements in the conju¬ 
gacy class of x is the index ofCc{x ) in G. 

Proof. The equation g 1 xg = h 1 xh holds if and only if lig 1 x = xhg 1 , which 
means that hg 1 G Cj; (x), that is,h and g lie in the same coset of Cg{x ) in G. The 
number of these cosets is the index of Cg(x) in G, so the lemma is proved. □ 

Corollary 14.12. The number of elements in any conjugacy class of a finite group G 
divides the order of G. 

Definition 14.13. The centre Z(G) of a group G is the set of all elements x G G such 
that xg = gx for all g G G. 

The centre of G is a normal subgroup of G. Many groups have trivial centre, for 
example Z(§ 3 ) = 1. Abelian groups go to the other extreme and have Z(G) = G. 

Lemma 14.14. If A is a finite abelian group whose order is divisible by a prime p, 
then A has an element of order p. 

Proof. Use induction on ,41. If 4 is prime the result follows. Otherwise take a 
proper subgroup M of A whose order m is maximal. If p divides m we are home 
by induction, so we may assume that p does not divide m. Let b be in A but not in 
M, and let B be the cyclic subgroup generated by b. Then MB is a subgroup of A, 
larger than M, so by maximality A = MB. From the First Isomorphism Theorem, 
Lemma 14.3(1), 

\MB\ = \M\\B\/\MDB\ 

so p divides the order r of B. Since B is cyclic, the element b r,r has order p. □ 

From this result we can derive a more general theorem of Cauchy in which the 
group need not be abelian: 

Theorem 14.15 (Cauchy’s Theorem). If a prime p divides the order of a finite 
group G, then G has an element of order p. 

Proof. We prove the theorem by induction on the order |G|. The first few cases |G| = 
1,2,3 are obvious. For the induction step, start with the class equation 

|G| = 1 + |C2| H-1- \C r \ 

Since p||G|, we must have p\\Cf for some j > 2. If x G Cj it follows that p\\Cg(x)\, 
since \Cj\ = \G\/\C g {x)\. 

If Cg{x ) f G then by induction Cj;(x) contains an element of order p, and this 
element also belongs to G. 
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Otherwise Cg(x) = G, which implies that x £ Z(G), and by choice 1 / 1, so 
2(G) ^ 1. 

Either p||Z(G)| or p\\Z(G)\. In the first case the proof reduces to the abelian 
case. Lemma 14.14. In the second case, by induction there exists x £ G such that 
the image x £ G/Z(G) has order p. That is, x p £ Z(G) but x Z(G). Let X be the 
cyclic group generated by x. Now XZ(G ) is abelian and has order divisible by p, so 
by Lemma 14.14 it has an element of order p, and again this element also belongs to 
G. 

This completes the induction step, and with it the proof. □ 

Cauchy’s Theorem does not work for composite divisors of |G|. See Exercise 
14.6. 


EXERCISES 

14.1 Show that the general dihedral group 

D„ = ( a,b : a" = b 2 = 1 ,b~ l ab = a~ l ) 

is a soluble group. Here a.b are generators and the equalities are relations 
between them. 

14.2 Prove that S„ is not soluble for n > 5, using only the simplicity of A5. 

14.3 Prove that a normal subgroup of a group is a union of conjugacy classes. Lind 
the conjugacy classes of A5, using the cycle type of the permutations, and 
hence show that A5 is simple. 

14.4 Prove that S„ is generated by the 2-cycles (12),..., (In). 

14.5 If the point a £ C is constructible by ruler and compasses, show that the Galois 
group of Q(a): Q is soluble. 

14.6 Show that A 5 has no subgroup of order 15, even though 15 divides its order. 

14.7 Show that S„ has trivial centre if n > 3. 

14.8 Lind the conjugacy classes of the dihedral group O,, defined in Exercise 14.1. 
Work out the centralisers of selected elements, one from each conjugacy class, 
and check Lemma 13.7. 

14.9 If G is a group and x,g £ G, show that Co{g^ 1 xg) = g^ l Cc(x)g. 

14.10 Show that the relation ‘normal subgroup of’ is not transitive. (Hint: Consider 
the subgroup G C Y C S 4 generated by the element (12)(34).) 
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14.11 There are (at least) two distinct ways to think about a group homomorphism. 
One is the definition as a structure-preserving mapping, the other is in terms 
of a quotient group by a normal subgroup. The relation between these is as 
follows. If 0 : G —> H is a homomorphism then 

ker(0)<G and G/ker(<j>) = im(<j>) 

If N<1 G then there is a natural surjective homomorphism 

(j):G-^G/N with ker(0) = IV 

Show that the first and second isomorphism theorems are the translations into 
‘quotient group’ language of two facts that are trivial in ‘structure-preserving 
mapping’ language: 

(1) The restriction of a homomorphism to a subgroup is a homomorphism. 

(2) The composition of two homomorphisms is a homomorphism. 

14.12* By counting the sizes of conjugacy classes, prove that the group of rotational 
symmetries of a regular icosahedron is simple. Show that it is isomorphic to 

A 5 . 

14.13 Mark the following true or false. 

(a) The direct product of two soluble groups is soluble. 

(b) Every simple soluble group is cyclic. 

(c) Every cyclic group is simple. 

(d) The symmetric group S n is simple if n > 5. 

(e) Every conjugacy class of a group G is a subgroup of G. 
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Chapter 15 

Solution by Radicals 


The historical aspects of the problem of solving polynomial equations by radicals 
have been discussed in the introduction. Early in his career, Galois briefly thought 
that he had solved the quintic equation by radicals. Figure 22. However, he found 
a mistake when it was suggested that he should try some numerical examples. This 
motivated his work on solubility by radicals. 

The object of this chapter is to use the Galois correspondence to derive a con¬ 
dition that must be satisfied by any polynomial equation that is soluble by radicals, 
namely: the associated Galois group must be a soluble group. We then constmct a 
quintic polynomial equation whose Galois group is not soluble, namely the disarm¬ 
ingly straightforward-looking t 5 — 6t + 3 = 0, which shows that the quintic equation 
cannot be solved by radicals. 

Solubility of the Galois group is also a sufficient condition for an equation to be 
soluble by radicals, but we defer this result to Chapter 18. 


15.1 Radical Extensions 

Some care is needed in formalising the idea of ‘solubility by radicals’. We begin 
from the point of view of field extensions. 

Informally, a radical extension is obtained by a sequence of adjunctions of nth 
roots, for various n. For example, the following expression is radical: 



To find an extension of Q that contains this element we may adjoin in turn elements 
a = \/11 p = V3 7 = y/(7 + j3)/2 8 = e= ^1 + 5 

Recall Definition 8.12, which formalises the idea of a radical extension: L.K is 
radical if L = K(a \,..., a w ) where for each j = 1 there exists nj such that 

a"/ €K(a u ...,(Xj-i) (j > 1) 

The elements a, form a radical sequence for L : K, and the radical degree of a, is 
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FIGURE 22: Galois thought he had solved the quintic... but changed his mind. 


For example, the expression (15.1) is contained in a radical extension of the form 
Q(a, P,y,S,e) of Q, where a 3 = 11 , p 2 = 3, y 5 = (7 + j3)/2, 5 3 = 4, e 4 = 1+5. 

It is clear that any radical expression, in the sense of the introduction, is contained 
in some radical extension. 

A polynomial should be considered soluble by radicals provided all of its zeros 
are radical expressions over the ground field. 

Definition 15.1. Let f be a polynomial over a subfield K of C, and let E be the 
splitting field for f over K. We say that f is soluble by radicals if there exists afield 
M containing E such that M : K is a radical extension. 

We emphasise that in the definition, we do not require the splitting field exten¬ 
sion E : K to be radical. There is a good reason for this. We want everything in the 
splitting field E to be expressible by radicals, but it is pointless to expect everything 
expressible by the same radicals to be inside the splitting field. Indeed.if M : K is 
radical and L is an intermediate field, then L : K need not be radical: see Exercise 
15.6. 

Note also that we require all zeros of / to be expressible by radicals. It is possible 
for some zeros to be expressible by radicals, while others are not—simply take a 
product of two polynomials, one soluble by radicals and one not. However, if an 
irreducible polynomial / has one zero expressible by radicals, then all the zeros 
must be so expressible, by a simple argument based on Corollary 5.13. 

The main theorem of this chapter is: 
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Theorem 15.2. If K is a subfield ofC and K C L C M C C where M : K is a radical 
extension, then the Galois group ofL : K is soluble. 

The otherwise curious word ‘soluble’ for groups arises in this context: a soluble 
(by radicals) polynomial has a soluble Galois group (of its splitting field over the 
base field). 

The proof of this result is not entirely straightforward, and we must spend some 
time on preliminaries. 

Lemma 15.3. If L : K is a radical extension in C and M is the normal closure of 
L : K, then M : K is radical. 

Proof. Let L = K (cti,..., a r ) with a"' G K (ai,..., a,-_ i). Let f be the minimal poly¬ 
nomial of a, over K. Then M I) Lis clearly the splitting field of flji fi- F°r every 
zero fijj of f in Af there exists an isomorphism cr: K(af) —> K(fjjj) by Corollary 5.13. 
By Proposition 11.4, cr extends to a /("-automorphism T : M —> M. Since a, is a mem¬ 
ber of a radical sequence for a subfield of M, so is /j, ; . By combining the sequences, 
we get a radical sequence for M. □ 

The next two lemmas show that certain Galois groups are abelian. 

Lemma 15.4. Let K be a subfield ofC, and let L be the splitting field for t p — 1 over 
K, where p is prime. Then the Galois group ofL : K is abelian. 

Proof. The derivative of t p — 1 is pt p ~ l , which is prime to t p — 1, so by Lemma 9.13 
the polynomial has no multiple zeros in L. Clearly its zeros form a group under 
multiplication; this group has prime order p since the zeros are distinct, so is cyclic. 
Let e be a generator of this group. Then L = Kie) so that any /^-automorphism of 
L is determined by its effect on e. Further, /^-automorphisms permute the zeros of 
t p — 1. Hence any /("-automorphism of L is of the form 

cij : e e j 

and is uniquely determined by this condition. 

But then a,a./ and a ; a, both map e to e' / , so the Galois group is abelian. □ 

It is possible to determine the precise structure of the above Galois group, and 
to remove the condition that p be prime. However, this needs extra work and is not 
needed at this stage. See Theorem 21.9. 

Lemma 15.5. Let K be a subfield ofC in which t n — 1 splits. Let a G K, and let L be 
a splitting field for t n — a over K. Then the Galois group ofL : K is abelian. 

Proof. Let a be any zero of t n — a. Since t" — 1 splits in K. the general zero of t" — a 
is ea where e is a zero of t n — 1 in K. Since L = K(a), any /^-automorphism of L is 
determined by its effect on a. Given two /("-automorphisms 


0 : a i-5- ea i/t:aGt]a 
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where e and T] € K are zeros of t n — 1, then 

//(a) = erja = r\ea = t/0(a) 

As before, the Galois group is abelian. □ 

The main work in proving Theorem 15.2 is done in the next lemma. 

Lemma 15.6. If K is a subfield ofC and L : K is normal and radical, then T(L : K) 
is soluble. 

Proof. Suppose that L = K(ai,...,a n ) with a" J G K(ai,...,(Xj-i). By Proposi¬ 
tion 8.9 we may assume that nj is prime for all j. In particular there is a prime p 
such that af g K. 

We prove the result by induction on n, using the additional hypothesis that all nj 
are prime. The case n = 0 is trivial, which gets the induction started. 

If a\ £ K. then L = K(o. 2 ,... ,a n ) and F(L : K) is soluble by induction. 

We may therefore assume that a i / K. Let / be the minimal polynomial of ai 
over K. Since L.K is normal, / splits in L; since K C C, / has no repeated zeros. 
Since ct\ / K. the degree of / is at least 2. Let /j he a zero of / different from (X \, and 
put e = «i //3. Then e p = 1 and fc'/1. Thus e has order p in the multiplicative group 
of L. so the elements 1, e, e 2 ,..., e 1 ’ 1 are distinct pth roots of unity in L. Therefore 
t p — 1 splits in L. 

Let M C L be the splitting field for t p — 1 over K. that is, let M = A/e). Consider 
the chain of subfields K C M C M ( a \) C L. The strategy of the remainder of the 
proof is illustrated in the following diagram: 

L 

r(L : M(a\ )) soluble by induction 

M(aj) 

r(M(ai) : M) abelian by Lemma 15.5 
M 

r (M : K) abelian by Lemma 15.4 
K 

Observe that L.K is finite and normal, hence so is L : M. therefore Theorem 12.2 
applies to L.K and to L : M. 

Since t p — 1 splits in M and a p € M. the proof of Lemma 15.5 implies that M{(t \) 
is a splitting field for t p — af over M. Thus M (a|): M is normal, and by Lemma 15.5 
r(M(ai) : M) is abelian. Apply Theorem 12.2 to L : M to deduce that 

T(M(ai): M) = T(L : M)/T{L : M(a\)) 
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Now 

L = M{a{){ 0C2,...,a n ) 

so that L : M(a i) is a normal radical extension. By induction Y(L : M(a\)) is soluble. 
Hence by Theorem 14.4(3), V(L : M) is soluble. 

Since M is the splitting field for t p — 1 over K , the extension M : K is normal. By 
Lemma 15.4, T{M : K) is abelian. Theorem 12.2 applied to L.K yields 

r(M : K) ~ r(L : K) /T(L : M) 

Now Theorem 14.4(3) shows that T(L\K) is soluble, completing the induction step. 

□ 


We can now complete the proof of the main result: 

Proof of Theorem 15.2. Let Kq be the fixed field of Y(L : K), and let N : M be the 
normal closure of M : Kq. Then 

KCKqCLCMCN 

Since M : Kq is radical. Lemma 15.3 implies that N: Kq is a normal radical extension. 
By Lemma 15.6, F(A^: ATo) is soluble. 

By Theorem 11.14, the extension L : Kq is normal. By Theorem 12.2 

r(L : K 0 ) S* r (N : K 0 )/r(N : L) 

Theorem 14.4(2) implies that T(L : Kq) is soluble. But T(L : K) = T(L : Kq), so 
r(L : K) is soluble. □ 

The idea of this proof is simple: a radical extension is a series of extensions by nth 
roots; such extensions have abelian Galois groups; so the Galois group of a radical 
extension is made up by fitting together a sequence of abelian groups. Unfortunately 
there are technical problems in carrying out the proof; we need to throw in roots of 
unity, and we have to make various extensions normal before the Galois correspon¬ 
dence can be used. These obstacles are similar to those encountered by Abel and 
overcome by his Theorem on Natural Irrationalities in Section 8.8. 

Now we translate back from fields to polynomials, and in doing so revert to Ga¬ 
lois’s original viewpoint. 

Definition 15.7. Let / be a polynomial over a subfield K of C, with splitting field E 
over K. The Galois group of / over K is the Galois group T(E : K). 

Let G be the Galois group of a polynomial / over K and let df = n. If a G E is a 
zero of /, then f(a) =0, so for any g £ G 

/(#(«)) =#(/(«)) =o 

Hence each element g £ G induces a permutation g' of the set of zeros of / in E. 
Distinct elements of G induce distinct permutations, since E is generated by the zeros 
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of /. It follows easily that the map g i-> g' is a group monomorphism of G into the 
group S n of all permutations of the zeros of /. In other words, we can think of G as 
a group of permutations on the zeros of /. This, in effect, was how Galois thought 
of the Galois group, and for many years afterwards the only groups considered by 
mathematicians were permutation groups and groups of transformations of variables. 
Arthur Cayley was the first to propose a definition for an abstract group, although it 
seems that the earliest satisfactory axiom system for groups was given by Leopold 
Kroneckerin 1870 (Huntingdon 1905). 

We may restate Theorem 15.2 as: 

Theorem 15.8. Let f be a polynomial over a subfield K ofC. Iff is soluble by radi¬ 
cals, then the Galois group of f over K is soluble. 

The converse also holds: see Theorem 18.21. 

Thus to find a polynomial not soluble by radicals it suffices to find one whose 
Galois group is not soluble. There are two main ways of doing this. One is to look at 
the general polynomial of degree n, which we introduced in Chapter 8 Section 8.7, 
but this approach has the disadvantage that it does not show that there are specific 
polynomials with rational coefficients that are insoluble by radicals. The alternative 
approach, which we now pursue, is to exhibit a specific polynomial with rational 
coefficients whose Galois group is not soluble. Since Galois groups are hard to cal¬ 
culate, a little low cunning is necessary, together with some knowledge of the sym¬ 
metric group. 


15.2 An Insoluble Quintic 

Watch carefully; there is nothing up my sleeve... 

Lemma 15.9. Let p be a prime, and let f be an irreducible polynomial of degree p 
over Q. Suppose that f has precisely two non-real zeros in C. Then the Galois group 
off over Q is isomorphic to the symmetric group § p . 

Proof By the Fundamental Theorem of Algebra, Theorem 2.4, C contains the split¬ 
ting field E of /. Let G be the Galois group of / over Q, considered as a permutation 
group on the zeros of /. These are distinct by Proposition 9.14, so G is (isomorphic 
to) a subgroup of S p . When we construct the splitting field of / we first adjoin an 
element of degree p, so [E : Q] is divisible by p. By Theorem 12.2(1), p divides the 
order of G. By Cauchy’s Theorem 14.15, G has an element of order p. But the only 
elements of S n having order p are the p-cycles. Therefore G contains a p-cycle. 

Complex conjugation is a Q-automorphism of C, and therefore induces a Q-auto- 
morphism of E. This leaves the p — 2 real zeros of / fixed, while transposing the two 
non-real zeros. Therefore G contains a 2-cycle. 

By choice of notation for the zeros, and if necessary taking a power of the p- 
cycle, we may assume that G contains the 2-cycle (12) and the p-cycle (12.. .p). We 
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FIGURE 23: A quintic with three real zeros. 


claim that these generate the whole of § p , which will complete the proof. To prove the 
claim, let c = (12 ...p),t = (12), and let G be the group generated by c and t. Then 
G contains c~ l tc = (23), hence c _1 (23)c = (34),... and hence all transpositions 
(, m,m +1). Then G contains 

(12) (23) (12) = (13) (13) (34) (13) = (14) 

and so on, and therefore contains all transpositions (1 m). Finally, G contains all 
products (l;H)(lr)(l»i) = (mr) with 1 <m< r. But every element of §„ is a product 
of transpositions, so G = S p . □ 

We can now exhibit a specific quintic polynomial over Q that is not soluble by 
radicals. 

Theorem 15.10. The polynomial t 5 — 6t + 3 over Q is not soluble by radicals. 

Proof. Let f{t) = t 5 — 6t + 3. By Eisenstein’s Criterion, / is irreducible over Q. We 
shall show that / has precisely three real zeros, each with multiplicity 1, and hence 
has two non-real zeros. Since 5 is prime, by Lemma 15.9 the Galois group of / over 
Q is S5. By Corollary 14.8, S5 is not soluble. By Theorem 15.8, f(t) = 0 is not 
soluble by radicals. 

It remains to show that / has exactly three real zeros, each of multiplicity 1. Now 
/(—2) = -17, /(-1) = 8, /(0) = 3, /(1) = -2, and /(2) = 23. A rough sketch of 
the graph of y = f(x) looks like Figure 23. This certainly appears to give only three 
real zeros, but we must be rigorous. By Rolle’s theorem, the zeros of / are separated 
by zeros of Of. Moreover, Df = 5t 4 — 6, which has two zeros at ± -y/6/5. Clearly / 
and Df are coprime, so / has no repeated zeros (this also follows by irreducibility) 
so / has at most three real zeros. But certainly / has at least three real zeros, since 
a continuous function defined on the real line cannot change sign except by passing 
through 0. Therefore / has precisely three real zeros, and the result follows. □ 
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15.3 Other Methods 

Of course this is not the end of the story. There are more ways of killing a quin- 
tic than choking it with radicals. Having established the inadequacy of radicals for 
solving the problem, it is natural to look further afield. 

First, some quintics are soluble by radicals. See Chapter 1 Section 1.4 and 
Berndt, Spearman and Williams (2002). What of the others, though? 

On a mundane level, numerical methods can be used to find the zeros (real or 
complex) to any required degree of accuracy. In 1303 (see Joseph 2000) the Chi¬ 
nese mathematician Zhu Shijie wrote about what was later called Horner’s method 
in the West; there it was long credited to the otherwise unremarkable William George 
Horner, who discovered it in 1819. For hand calculations it is a useful practical 
method, but there are many others. The mathematical theory of such numerical meth¬ 
ods can be far from mundane—but from the algebraic point of view it is unilluminat¬ 
ing. 

Another way of solving the problem is to say, in effect, ‘What’s so special about 
radicals?’ Suppose for any real number a we define the ultraradical of a to be the 
real zero of t 5 + t — a. It was shown by G.B. Jerrard (see Kollros 1949, p. 19) that 
the quintic equation can be solved by the use of radicals and ultraradicals. See King 
(1996). 

Instead of inventing new tools we can refashion existing ones. Charles Hermite 
made the remarkable discovery that the quintic equation can be solved in terms of 
‘elliptic modular functions’, special functions of classical mathematics which arose 
in a quite different context, the integration of algebraic functions. The method is 
analogous to the trigonometric solution of the cubic equation. Exercise 1.8. In a tri¬ 
umph of mathematical unification, Klein (1913) succeeded in connecting together 
the quintic equation, elliptic functions, and the rotation group of the regular icosahe¬ 
dron. The latter is isomorphic to the alternating group A 5 , which we have seen plays 
a key part in the theory of the quintic. Klein’s work helped to explain the unexpected 
appearance of elliptic functions in the theory of polynomial equations; these ideas 
were subsequently generalised by Henri Poincare to cover polynomials of arbitrary 
degree. 
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EXERCISES 

15.1 Find radical extensions of Q containing the following elements of C, by ex¬ 
hibiting suitable radical sequences (See Definition 8.12): 

(a) {s/n-y/73)/s/5 

(b) (V 6 + 2^5 ) 4 

(c) (2\/5—A)/\J 1 + y/99 

15.2 What is the Galois group of t p — 1 over Q for prime pi 

15.3 Show that the polynomials f 5 — At+ 2, f 5 — At 2 +2, f 5 — 6 f 2 + 3, and t 1 — 10f 5 + 
15f + 5 over Q are not soluble by radicals. 

15.4 Solve the sextic equation 

f 6 -f 5 +f 4 -f 3 +f 2 -f + 1=0 

satisfied by a primitive 14th root of unity, in terms of radicals (Hint: Put u = 

t + 1/f.) 

15.5 Solve the sextic equation 

f 6 + 2f 5 -5f 4 + 9f 3 -5f 2 +2f + l =0 
by radicals (Hint: Put u = t + 1 /t .) 

15.6* If L : K is a radical extension in C and M is an intermediate field, show that 
M : K need not be radical. 

15.7 If p is an irreducible polynomial over K C C and at least one zero of p is 
expressible by radicals, prove that every zero of p is expressible by radicals. 

15.8* If K C C and a 2 = a € K, [3 2 = b £ K, and none of a, b , ab are squares in K , 
prove that K(a, j3): K has Galois group Z 2 x Z 2 . 

15.9* Show that if N is an integer such that /V| > 1, and p is prime, then the quintic 
equation 

x 5 —Npx + p = 0 

cannot be solved by radicals. 

15.10* Suppose that a quintic equation /(f) = 0 over Q is irreducible, and has one 
real root and two complex conjugate pairs. Does an argument similar to that of 
Lemma 15.9 prove that the Galois group contains A 5 ? If so, why? If not, why 
not? 

15.11 Prove the Theorem on Natural Irrationalities using the Galois correspondence. 
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15.12 Mark the following true or false. 

(a) Every quartic equation over a subfield of C can be solved by radicals. 

(b) Every radical extension is finite. 

(c) Every finite extension is radical. 

fd) The order of the Galois group of a polynomial of degree n divides n\ 

(e) Any reducible quintic polynomial can be solved by radicals. 

(f) There exist quartics with Galois group §4. 

(g) An irreducible polynomial of degree 11 with exactly two non-real zeros 
has Galois group §n. 

(h) The normal closure of a radical extension is radical. 

(i) A 5 has 50 elements. 



Chapter 16 

Abstract Rings and Fields 


Having seen how Galois Theory works in the context assumed by its inventor, we 
can generalise everything to a much broader context. Instead of subfields of C, we 
can consider arbitrary fields. This step goes back to Weber in 1895, but first achieved 
prominence in the work of Emil Artin in lectures of 1926, later published as Artin 
(1948). With the increased generality, new phenomena arise, and these must be dealt 
with. 

One such phenomenon relates to the Fundamental Theorem of Algebra, which 
does not hold in an arbitrary field. We could get round this by constructing an ana¬ 
logue, the ‘algebraic closure’ of a field, in which every polynomial splits into linear 
factors. However, the machinery needed to prove the existence of an algebraic clo¬ 
sure is powerful enough to make the concept of an algebraic closure irrelevant any¬ 
way. So we concentrate on developing that machinery, which centres on the abstract 
properties of field extensions, especially finite ones. 

A more significant problem is that a general field K need not contain Q as a sub¬ 
field. The reason is that sums 1 + 1-1-hi can behave in novel ways. In particular, 

such a sum may be zero. If it is, then the smallest number of Is involved must be a 
prime p, and K contains a subfield isomorphic to Z p , the integers modulo p. Such 
fields are said to have ‘characteristic’ p, and they introduce significant complications 
into the theory. The most important complication is that irreducible polynomials need 
not be separable; that is, they may have multiple zeros. Separability is automatic for 
subfields of C, so it has not been seen to play a major role up to this point. However, 
behind the scenes it has been one of the two significant constraints that make Galois 
theory work, the other being normality. From now on, separability has to be taken a 
lot more seriously, and it has a substantial effect. 

Rethinking the old results in the new context provides good revision and rein¬ 
forcement, and it explains where the general concepts come from. Nonetheless, if 
you seriously work through the material and do not just accept that everything works, 
you will come to appreciate that Bourbaki had a point. 


16.1 Rings and Fields 

Today’s concepts of ‘ring’ and ‘field’ are the brainchildren of Dedekind, who 
introduced them as a way of systematising algebraic number theory; their influence 
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then spread as was reinforced by the growth of abstract algebra under the influence of 
Weber, Hilbert, Emmy Noether, and Bartel Leenert van der Waerden. These concepts 
are motivated by the observation that the classical number systems Z, Q, R, and C 
enjoy a long list of useful algebraic properties. Specifically, Z is a ‘ring’ and the 
others are ‘fields’. 

The formal definition of a ring is: 

Definition 16.1. a ring R is a set, equipped with two operations of addition (denoted 
a + b) and multiplication (denoted ab), satisfying the following axioms: 

(Al) a + b = b + a for all a,b £ R. 

(A2) (a + b) +c = a + (b + c) for all a,b,c € R. 

(A3) There exists 0 € R such that 0 + a = a for all a <G R. 

(A4) Given a € R, there exists —a € R such that a + (—a) = 0. 

(Ml) ab = bci for all a,b €R. 

(M2) (ab)c = a(bc) for all a, b, c G R. 

(M3) There exists 1 SS such that la = a for all a € R. 

(D) a(b + c) = ab + ac for all a,b,c £ R. 

(The standard definition of a ring omits (M3): with that condition, the standard 
term is ‘ring-with-1’ or ‘unital ring’ or various similar phrases. Since nearly all rings 
that we need have a 1, it seems simpler to require (M3). Occasionally, we dispense 
with it.) 

When we say that addition and multiplication are ‘operations’ on R , we auto¬ 
matically imply that if a,b £ R then a + b,ab € R, so R is ‘closed’ under each of 
these operations. Some axiom systems for rings include these conditions as explicit 
axioms. 

Axioms (Al) and (Ml) are the commutative laws for addition and multiplication, 
respectively. Axioms (A2) and (M2) are the associative laws for addition and multi¬ 
plication, respectively. Axiom (D) is the distributive law. The element 0 is called the 
additive identity or zero element, the element 1 is called the multiplicative identity 
or unity element. The element — a is the additive inverse or negative of a. The word 
‘the’ is justified here because 0 is unique, and for any given a € F the inverse —a is 
unique. The condition 1 ^ 0 in (M3) excludes the trivial ring with one element. 

The modern convention is that axioms (Ml) and (M3) are optional for rings. Any 
ring that satisfies (Ml) is said to be commutative, and any ring that satisfies (M3) is a 
ring with 1. However, in this book the phrase ‘commutative ring with 1 ’ is shortened 
to ‘ring’, because we do not require greater generality. 

Examples 16.2. (1) The classical number systems Z, Q, R, C are all rings. 

(2) The set of natural numbers N is not a ring, because axiom (A4) fails. 

(3) The set Z [/] of all complex numbers of the form a + bi, with a, b € Z, is a ring. 
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(4) The set of polynomials Z[t\ over Z is a ring, as the usual name ‘ring of polyno¬ 
mials’ indicates. 

(5) The set of polynomials Z[ti,... ,t n \ in n indeterminates over Z is a ring. 

( 6 ) If n is any integer, the set Z n of integers modulo n is a ring. 

If R is a ring, then we can define subtraction by 

a — b = a + (—b) a,b£R 

The axioms ensure that all of the usual algebraic rules of manipulation, except those 
for division, hold in any ring. 

Two extra axioms are required for a field: 

Definition 16.3. A field is a ring F satisfying the extra axiom 

(M4) Given a £ F, with a ^ 0, there exists a~ l £ F such that aa~ l = 1. 

(M4) 1^0. 

Without condition (M5) the set {0} would be a field with one element: this causes 
problems and is usually avoided. 

We call a 1 the multiplicative inverse of a / 0. This inverse also unique. If F is 
a field, then we can define division by 

a/b=ab~ l a,b£F,bf=0 

The axioms ensure that all the usual algebraic rules of manipulation, including those 
for division, hold in any field. 

Examples 16.4. (1) The classical number systems Q, R, C are all fields. 

(2) The set of integers Z is not a field, because axiom (M4) fails. 

(3) The set Q[i] of all complex numbers of the form a + bi, with a,b £ Q, is a field. 

(4) The set of polynomials Q[f] over Q is not a field, because axiom (M4) fails. 

(5) The set of rational functions Q(t) over Q is a field. 

( 6 ) The set of rational functions Q(fi,... ,t n ) in n indeterminates over Q is a field. 

(7) The set Z 2 of integers modulo 2 is a field. The multiplicative inverses of the only 
nonzero element 1 is 1 1 = 1. In this field, 1 + 1=0. So 1 + 1^0 does not count as 
one of the ‘usual laws of algebra’. Note that it involves an inequality; the statement 
1 + 1 = 2 is true in Z 2 . What is not true is that 2^0. 

( 8 ) The set Zg of integers modulo 6 is not a field, because axiom (M4) fails. In fact, 
the elements 2,3,4 do not have multiplicative inverses. Indeed, 2.3 = 0 but 2,3 7 ^ 0, a 
phenomenon that cannot occur in a field: if F is a field, and a.h f 0 in F but ab = 0, 
then a = abb " 1 = Ob 1 = 0, a contradiction. 

(9) The set Z 5 of integers modulo 5 is a field. The multiplicative inverses of the 
nonzero elements are 1 _1 = 1,2 _1 = 3,3 _1 = 2,4 _1 = 4. In this field, 1 + 1 + 1 + 
1 + 1 = 0 . 

(10) The set Z\ of integers modulo 1 is not a field. It consists of the single element 
0, and so violates (M3) which states that 1 / 0. This is a sensible convention since 1 
is not prime. 
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The fields Z 2 and Z 5 , or more generally Z p where p is prime (see Theorem 16.7 
below), are prototypes for an entirely new kind of field, with unusual properties. 
For example, the formula for solving quadratic equations fails spectacularly over Z 2 . 
Suppose that we want to solve 

t 2 T at + (? = 0 

where a,b £ Z 2 . Completing the square involves rewriting the equation in terms of 
(t + a/2). But a/2 = a/0, which makes no sense. The standard quadratic formula 
involves division by 2 and also makes no sense. Nevertheless, many choices of a,b 
here lead to soluble equations: 

t 2 = 0 has solution t = 0 
t 2 + 1 = 0 has solution t = 1 
t 2 +t = 0 has solutions t = 0,1 
t 2 +1 + 1 =0 has no solution 


16.2 General Properties of Rings and Fields 

We briefly develop some of the basic properties of rings and fields, with emphasis 
on structural features that will allow us to construct examples of fields. Among these 
features are the presence or absence of ‘divisors of zero’ (like 2,3 £ Zg), leading 
to the concept of an integral domain, and the notion of an ideal in a ring, leading to 
quotient rings and a general construction for interesting fields. Most readers will have 
encountered these ideas before; if not, it may be a good idea to find an introductory 
textbook and work through the first two or three chapters. For example, Fraleigh 
(1989) and Sharpe (1987) cover the relevant material. 

Definition 16.5. (1) A sub ring of a ring R is a non-empty subset S of R such that if 
a, b £ S then a + b £ S, a — b £ S, and ab £ S. 

Note that by this definition a subring need not satisfy (M3). This is one of the dis¬ 
advantages of simplifying ‘ring-with-1’ to ‘ring’. Perhaps we ought to define ‘ring- 
without-a- 1 ’. 

(2) A subfield of a field F is a subset S of F containing the elements 0 and 1, such 
that if a,b £ S then a + b, a — b, ab £ .S', and further i f « / 0 then a 1 £ S. 

(3) An ideal of a ring R is a subring I such that if i £ I and r £ R then ir and ri lie in 
I. 


Thus Z is a subring of Q, and K is a subfield of C, while the set 2Z of even 
integers is an ideal of Z. 

If R.S are rings, then a ring homomorphism 0 : R —> S is a map that satisfies three 
conditions: 

0(1) = 1 0(n+r2) = 0(n) + 0te) 0 (n>' 2 ) = 0(n)0(r 2 ) for all n , r 2 £ R 




General Properties of Rings and Fields 


185 


The kernel l<cr@ of & is { r : @(r) =0}. It is an ideal of R. An isomorphism is a 
homomorphism that is one-to-one and onto; a monomorphism is a homomorphism 
that is one-to-one. A homomorphism is a monomorphism if and only if its kernel is 
zero. 

The most important property of an ideal is the possibility of working modulo that 
ideal, or, more abstractly, constructing the ‘quotient ring’ by that ideal. Specifically, 
if 7 is an ideal of the ring R , then the quotient ring R/I consists of the cosets / + ,v of 
7 in R (considering R as a group under addition) The operations in the quotient ring 
are: 


(7 + r) + (7 + s) = I+(r + s) 

(7 + r) (7 + s) = 7+ (rs) 

where r, s G R and I + r is the coset {/ + r : i € 7}. 

Examples 16.6. (1) Let nL be the set of integers divisible by a fixed integer n. This 
is an ideal of Z, and the quotient ring Z„ = Z/nZ is the ring of integers modulo n, 
that is, Z„. 

(2) Let R = K\t] where K is a subfield of C, and let m(t) be an irreducible polynomial 
over K. Define I = ( m(t )) to be the set of all multiples of m(t). Then 7 is an ideal, 
and R/I is what we previously denoted by K\t\/(m) in Chapter 5. This quotient is a 
field. 

(3) We can perform the same construction as in Example 2, without taking m to be 
irreducible. We still get a quotient ring, but if m is reducible the quotient is no longer 
a field. 

When I is an ideal of R , there is a natural ring homomorphism 0 : R —> R/I, 
defined by 0 (r) = I + r. Its kernel is I. 

We shall need the following property of Z„, which explains the differences we 
found among Z 2 , Z 5 , and Zg. 

Theorem 16.7. The ring Z„ is a field if and only if n is a prime number. 

Proof First suppose that n is not prime. If n = 1, then Z„ = Z/Z, which has only one 
element and so cannot be a field. If n > 1 then n = rs where r and s are integers less 
than n. Putting I = nZ, 

(I + r)(I + s) = I + rs = I 

But I is the zero element of Z/7, while I + r and I + s are non-zero. Since in a field 
the product of two non-zero elements is non-zero, Z/7 cannot be a field. 

Now suppose that n is prime. Let 7 + r be a non-zero element of Z/7. Then r and 
n are coprime, so by standard properties of Z there exist integers a and b such that 
ar + bn = 1. Therefore 

(7 + a) (7 + r) = (7 + 1) — (7 + n) (7 + b) = 7 + 1 


and similarly 


(7 + r)(7 + a) — 7+1 
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Since I + 1 is the identity element of Z/7, we have found a multiplicative inverse for 
the given element I+ r. Thus every non-zero element of 'Lll has an inverse, so that 
L n = Z/7 is a field. □ 

From now on, when dealing with Z„, we revert to the usual convention and write 
the elements as 0,1,2,... ,n — 1 rather than I,I+l,I + 2,...,I + n— 1. 


16.3 Polynomials Over General Rings 

We now introduce polynomials with coefficients in a given ring. The main point 
to bear in mind is that identifying polynomials with functions, as we cheerfully did 
in Chapter 2 for coefficients in C, is no longer a good idea, because Proposition 2.3, 
which states that polynomials defining the same function are equal, need not be true 
when the coefficients belong to a general ring. 

Indeed, consider the ring L^. Suppose that f(t) = t 2 + 1 ,g(t) = t 4 + 1. There are 
numerous reasons to want these to be different polynomials, the most obvious being 
that they have different coefficients. But if we interpret them as functions from Z 2 to 
itself, we find that /(0) = 1 = g( 0) and /(1) = 0 = g(l). As functions, / and g are 
equal. 

It turns out that a problem arises here because the ring is finite. Since finite rings 
(especially finite fields) are important, we need a definition of ‘polynomials’ that does 
not rely on interpreting them as functions. We did this in Section 2.1 for polynomials 
over C, and the same idea works for any ring. 

To be specific, let A 1 be a ring. We define a polynomial over R in the indeterminate 
t to be an expression 

r Q + r d H- \~ r nt n 

where ro ,..., r n £ R, 0 < n € Z, and t is undefined. Again, for set-theoretic purity we 
can replace such an expression by the sequence (ro, ...,r n ), as in Exercise 2.2. The 
elements ro,..., r„ are the coefficients of the polynomial. 

Two polynomials are defined to be equal if and only if the corresponding co¬ 
efficients are equal (with the understanding that powers of t not occurring in the 
polynomial may be taken to have zero coefficient). The sum and the product of two 
polynomials are defined using the same formulas (2.3, 2.4) as in Section 2.1, but 
now the r, belong to a general ring. It is straightforward to check that the set of all 
polynomials over R in the indeterminate t is a ring—the ring of polynomials over R 
in the indeterminate 1. As before, we denote this by the symbol /?[?]. We can also 
define polynomials in several indeterminates t\. to. ... and obtain the polynomial ring 
/?[?!. f 2 . ■ ■ ■ | • Again, each polynomial / € R[t] defines a function from R to R. We use 
the same symbols /, to denote this function. If f(t) = £r,f' then /(a) = £r,-a', for 
a € R. We reiterate that two distinct polynomials over R may give rise to the same 
function on R. 
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Proposition 2.3 is still true when R = R,Q, or Z, with the same proof. And the 
definition of ‘degree’ applies without change, as does the proof of Proposition 2.2. 


16.4 The Characteristic of a Field 

In Proposition 4.4 we observed that every subfield of C must contain Q. The 

main step in the proof was that the sub field contains all elements 1 + 1 H-b 1 , that 

is, it contains N, hence Z, hence Q. 

The same idea nearly works for any field. However, a finite field such as Z 5 
cannot contain Q, or even anything isomorphic to Q, because Q is infinite. How does 
the proof fail? As we have already seen, in Z 5 the equation 1+1+1+1+1=0 
holds. So we can build up a unique smallest subfield just as before—but now it need 
not be isomorphic to Q. 

Pursuing this line of thought leads to: 

Definition 16.8. The prime subfield of a field K is the intersection of all subfields of 
K. 


It is easy to see that the intersection of any collection of subfields of K is a 
subfield (the intersection is not empty since every subfield contains 0 and 1 ), and 
therefore the prime subfield of K is the unique smallest subfield of K. The fields Q 
and Z p (p prime) have no proper subfields, so are equal to their prime subfields. The 
next theorem shows that these are the only fields that can occur as prime subfields. 

Theorem 16.9. For every field K, the prime subfield ofK is isomorphic either to the 
field Q of rationals or the field Z p of integers modulo a prime number p. 

Proof Let K be a field, P its prime subfield. Then P contains 0 and 1, and therefore 
contains the elements n*(n£ Z) defined by 


n 


* 


1 + 1 +... + 1 ( n times) if n > 0 
0 if n = 0 

— (—«)* if n < 0 


A short calculation using the distributive law (D) and induction shows that the 
map * : Z —>■ P so defined is a ring homomorphism. Two distinct cases arise. 

(1) n* = 0 for some n 7b 0. Since also (—n)* = 0, there exists a smallest positive 
integer p such that p* = 0. If p is composite, say p = rs where r and s are smaller 
positive integers, then r*s* = p* = 0, so either r = 0 or s* = 0, contrary to the 
definition of p. Therefore p is prime. The elements n* form a ring isomorphic to Z p , 
which is a field by Theorem 16.7. This must be the whole of P, since P is the smallest 
subfield of K. 

(2) n* / 0 i f n f 0. Then P must contain all the elements m* /n* where m, n are 

integers and n 7 b 0. These form a subfield isomorphic to Q (by the map which sends 
m*/n* to m/n) which is necessarily the whole of P. □ 
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The distinction among possible prime subfields is summed up by: 

Definition 16.10. The characteristic of a field K is 0 if the prime subfield of K is 
isomorphic to Q, and p if the prime subfield of K is isomorphic to Z p . 

For example, the fields Q, R, C all have characteristic zero, since in each case the 
prime subfield is Q. The field Z p ( p prime) has characteristic p. We shall see later 
that there are other fields of characteristic p: for an example, see Exercise 16.6. 

The elements n* defined in the proof of Theorem 16.9 are of considerable impor¬ 
tance in what follows. It is conventional to omit the asterisk and write n instead of 
n* . This abuse of notation will cause no confusion as long as it is understood that n 
may be zero in the field without being zero as an integer. Thus in Z 5 we have 10 = 0 
and 2 = 7 = —3. This difficulty does not arise in fields of characteristic zero. 

With this convention, a product nk (n £Z, k £ K) makes sense, and 

nk = ±(kH-h k) 

Lemma 16.11. IfK is a subfield ofL, then K and L have the same characteristic. 

Proof. In fact, K and L have the same prime subfield. □ 

Lemma 16.12. If k is a non-zero element of the field K, and if n is an integer such 
that nk = 0, then n is a multiple of the characteristic of K. 

Proof. We must have n = 0 in K. that is, in old notation, n* = 0. If the characteristic 
is 0, then this implies that n = 0 as an integer. If the characteristic is p > 0, then it 
implies that n is a multiple of p. □ 


16.5 Integral Domains 

The ring Z has an important property, which is shared by many of the other rings 
that we shall be studying: if mn = 0 where m.n are integers, then m = 0 or n = 0. We 
abstract this property as: 

Definition 16.13. A ring R is an integral domain if rs = 0, for r.s £ R. implies that 
r = 0 or s = 0. 

We often express this condition as ‘D has no zero-divisors’, where a zero-divisor 
is a non-zero element a £ D for which there exists a non-zero element b £ D such 
that ab = 0. 

Examples 16.14. (1) The integers Z form an integral domain. 

(2) Any field is an integral domain. For suppose K is a field and rs = 0. Then either 
s = 0, or r = rss~ l = Os -1 = 0. 

(3) The ring Z( is not an integral domain. As observed earlier, in this ring 2.3 = 0 
but 2 ,3^0. 
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(4) The polynomial ring Z[t] is an integral domain. If f{t)g{t) = 0 as polynomials, 
but f(t),g(t) f 0, then we can find an element x G Z such that f(x) f 0,g(x) f 0. 
(Just choose x different from the finite set of zeros of / together with zeros of g.) But 
then f{x)g(x) /= 0, a contradiction. 

It turns out that a ring is an integral domain if and only if it is (isomorphic to) 
a subring of some field. To understand how this comes about, we analyse when it 
is possible to embed a ring R in a field—that is, find a field containing a subring 
isomorphic to R. Thus Z can be embedded in Q. This particular example has the 
property that every element of Q is a fraction whose numerator and denominator lie 
in Z. We wish to generalise this situation. 

Definition 16.15. A field of fractions of the ring R is a field K containing a subring 
R' isomorphic to R, such that every element of K can be expressed in the form r/s 
for r, s £ R', where ,v / 0. 

To see how to construct a field of fractions for R , we analyse how Z is embedded 
in Q. We can think of a rational number, written as a fraction r/s, as an ordered pair 
(r, s) of integers. However, the same rational number corresponds to many distinct 
fractions: for instance f = g = and so on. Therefore the pairs (2, 3), (4, 6), and 
(10, 15) must be treated as if they are ‘the same’. The way to achieve this is to define 
an equivalence relation that makes them equivalent to each other. In general (r, s) 
represents the same rational as (t, u) if and only if r/s = t/u, that is, ru = st. In this 
form the condition involves only the arithmetic of Z. By generalising these ideas we 
obtain: 

Theorem 16.16. Every integral domain possesses afield of fractions. 

Proof Let R be an integral domain, and let S be the set of all ordered pairs (r. s) 
where r and .v lie in R and s / 0. Define a relation ~ on S by 

(r,s) ~ (. t,u ) •<=>■ ru = st 

It is easy to verify that ~ is an equivalence relation; we denote the equivalence class 
of (r, s) by \r, s]. The set F of equivalence classes will provide the required field of 
fractions. First we define the operations on F by 

[r,s] + [f,n] = [ru+ts,su] 

[r, j>] [r, w] = [rt,su] 

Then we perform a long series of computations to show that F has all the required 
properties. Since these computations are routine we shall not perform them here, but 
if you’ve never seen them, you should check them for yourself, see Exercise 16.7. 
What you have to prove is: 

(1) The operations are well defined. That is to say, if (r, s) ~ (V, s') and (t, u ) ~ 
(/, u'), then 

[r, s] + [f, h] = \r',s') + [f ,u'\ 

[r,s][/H] = [fi,s'][t',u'} 
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(2) They are operations on F (this is where we need to know that R is an integral 
domain). 

(3) F is a field. 

(4) The map R —> F which sends r —> \r, 1] is a monomorphism. 

(5) [r,s] = [r, l]/[s, 1], 


□ 

It can be shown (Exercise 16.8) that for a given integral domain R , all fields of 
fractions are isomorphic. We can therefore refer to the field constructed above as the 
field of fractions of R. It is customary to identify an element r £ R with its image 
[r, 1] in F, whereupon [r, s] = r/s. 

A short calculation reveals a useful property: 

Lemma 16.17. If R is an integral domain and t is an indeterminate, then Rt] is an 
integral domain. 

Proof. Suppose that 


/ — /0 + /1H- ffnt" g — gO+glM-T gml’" 

where /,, / 0 / g,„ and all the coefficients lie in R. The coefficient of t'" 1 " in fg is 
f n g m , which is non-zero since R is an integral domain. Thus if/, g are non-zero then 
fg is non-zero. This implies that R t] is an integral domain, as claimed. □ 

Corollary 16.18. If F is afield, then the polynomial ring F[t \,.../„] in n indetermi- 
nates is an integral domain for any n. 

Proof. Write F[ti,... ,t n ] = F\t\\ [t 2 , ■ ■ ■ ,t n \ and use induction. □ 

Proposition 2.2 applies to polynomials over any integral domain. 

Theorem 16.16 implies that when R is an integral domain, R \t j has a field of 
fractions. We call this the field of rational expressions in t over R and denote by R{t). 
Its elements are of the form p(t)/q(t) where p and q are polynomials and q is not the 
zero polynomial. Similarly /? [fi, ..., t„] has a field of fractions R(t\,..., t n )• Rational 
expressions can be considered as fractions p(t)/q(t), where p,q £ R[r] and q is not 
the zero polynomial. If we add two such fractions together, or multiply them, the 
result is another such fraction. In fact, by the usual rules of algebra. 


p(t) K0 

q(t) s(t) 
P(t) r(t) 
q(t) s(t) 


P(t)r( t) 

q{t)s{t) 

p{t)s{t) + q(t)r{t) 

q(t)s(t) 
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We can also divide and subtract such expressions: 

Pit),r(t) = p(t)s{t) 
q(t) 7 s(t) q{t)r{t) 

P{t) r(t ) = p{t)s(t) - q{t)r(t) 
q(t) s(t) q(t)s(t) 

where in the first equation we assume r(t) is not the zero polynomial. 

The Division Algorithm and the Euclidean Algorithm work for polynomials over 
any field, without change. Therefore the entire theory of factorisation of polynomials, 
including irreducibles, works for polynomials in whose coefficients lie in any 
field K. 


EXERCISES 

16.1 Show that 15Z is an ideal of 5 Z, and that 5Z/15Z is isomorphic to Z3. 

16.2 Are the rings Z and 2Z isomorphic? 

16.3 Write out addition and multiplication tables for Zg, Z7, and Which of these 
rings are integral domains? Which are fields? 

16.4 Define a prime field to be a field with no proper subfields. Show that the prime 
fields (up to isomorphism) are precisely Q and Z„ (p prime). 

16.5 Find the prime subfield of Q, R, C, Q(f), R(f), C(f), Z 5 (f), Zn(fi,f 2 )- 

16.6 Show that the following tables define a field. 

+ I 0 lafi ■ 0 1 a P 

001aj3 00000 

1 1 0 p a 1 0 1 a 

a a 1 3 0 1 a 0 a p \ 

P P a \ 0 p 0 p 1 a 

Find its prime subfield P. 

16.7 Prove properties (1-5) listed in the construction of the field of fractions of an 
integral domain in Theorem 16.16. 

16.8 Fet D be an integral domain with a field of fractions F. Fet K be any 

field. Prove that any monomorphism 0 : D —> K has a unique extension to 
a monomorphism t ff : F K defined by 


\ir{a/b) = <t>(a)/<t>(b) 
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for a, b £ D. By considering the case where K is another field of fractions for 
D and 0 is the inclusion map show that fields of fractions are unique up to 
isomorphism. 

16.9 Let K = 'Ll. Describe the subfields of K(t) of the form: 

(a) K(t 2 ) 

(b) K(t + 1) 

(c) K(t 5 ) 
fd) ^(f 2 + l) 

16.10 Does the condition d(f + g) < ma x(df,dg) hold for polynomials f.g over a 
general ring? 

By considering the polynomials 3 1 and 2 1 over 7L( show that the equality 
d ( fg ) = df + dg fails for polynomials over a general ring R. What if R is 
an integral domain? 

16.11 Mark the following true or false: 

(a) Every integral domain is a field. 

(b) Every field is an integral domain. 

(c) If F is a field, then F[t] is a field, 
fd) If F is a field, then F(t) is a field. 

(e) Z(f) is a field. 
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Having defined rings and fields, and equipped ourselves with several methods for 
constructing them, we are now in a position to attack the general structure of an 
abstract field extension. Our previous work with subfields of C paves the way, and 
most of the effort goes into making minor changes to terminology and checking 
carefully that the underlying ideas generalise in the obvious manner. 

We begin by extending the classification of simple extensions to general fields. 
Having done that, we assure ourselves that the theory of normal extensions, including 
their relation to splitting fields, carries over to the general case. A new issue, sepa¬ 
rability, comes into play when the characteristic of the field is not zero. The main 
result is that the Galois correspondence can be set up for any finite separable normal 
extension, and it then has exactly the same properties that we have already proved 
over C. 

Convention on Generalisations . Much of this chapter consists of routine veri¬ 
fication that theorems previously stated and proved for subfields or subrings of C 
remain valid for general rings and fields—and have essentially the same proofs. As 
a standing convention, we refer to ‘Lemma X.Y (generalised)’ to mean the generali¬ 
sation to an arbitrary ring or field of Lemma X.Y; usually we do not restate Lemma 
X.Y in its new form. In cases where the proof requires a new method, or extra hy¬ 
potheses, we will be more specific. Moreover, some of the most important theorems 
will be restated explicitly. 


17.1 Minimal Polynomials 

Definition 17.1. A field extension is a monomorphism I : K —> L, where K. L are 
fields. 

Usually we identify K with its image I ( K ), and in this case K becomes a subfield 
of L. 

We write L.K for an extension where K is a subfield of L. In this case, I is the 
inclusion map. 

We define the degree [L: K] of an extension L : K exactly as in Chapter 6. Namely, 
consider L as a vector space over K and take its dimension. The Tower Law remains 
valid and has exactly the same proof. 
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In Chapter 16 we observed that all of the usual properties of factorisation of poly¬ 
nomials over C carry over, without change, to general polynomials. (Even Gauss’s 
Lemma and Eisenstein’s Criterion can be generalised to polynomials over suitable 
rings, but we do not discuss such generalisations here.) Specifically, the definitions of 
reducible and irreducible polynomials, uniqueness of factorisation into irreducibles, 
and the concept of a highest common factor, or hcf, carry over to the general case. 
Moreover, if K is a field and h £ K\t\ is an hcf of f,g £ K[t], then there exist a.b £ K\t] 
such that h = af + bg. As before, a polynomial is monic if its term of highest degree 
has coefficient 1 . 

If L : K is a field extension and a £ L. the same dichotomy arises: either a is a 
zero of some polynomial f £ K\t], or it is not. In the first case a is algebraic over K; 
in the second case a is transcendental over K. 

An element a £L that is algebraic over K has a well-defined minimal polynomial 
m(t) £ K{t}\ this is the unique monic polynomial over K of smallest degree such that 
m(a) = 0. 


17.2 Simple Algebraic Extensions 

As before, we can define the subfield of L generated by a subset X C L, together 
with some subfield K , and we employ the same notation K(X) for this field. We say 
that it is obtained by adjoining X to K. The terms finitely generated extension and 
simple extension generalise without change. 

We mimic the classification of simple extensions in C of Chapter 5. Simple tran¬ 
scendental extensions are easy to analyse, and we obtain the same result: every sim¬ 
ple transcendental extension K (a) of K is isomorphic to K (t) : K , the field of rational 
expressions in one indeterminate t. Moreover, there is an isomorphism that carries t 
to a. 

The algebraic case is slightly trickier: again the key is irreducible polynomials. 
The result that opens up the whole area is: 

Theorem 17.2. Let K be afield and suppose that m £ K[t] is irreducible and monic. 
Let I be the ideal of K[t] consisting of all multiples of m. Then K[t]/I is afield, and 
there is a natural monomorphism l : K —> K[t]/I such that i(k) = I + k. Morover, 
I + k is a zero of m, which is its minimal polynomial. 

Proof. First, observe that I really is an ideal (Exercise 17.1). We know on general 
nonsense grounds that K[t]/I is a ring. So suppose that I + f £ K\t]/I is not the zero 
element, which in this case means that / qL I. Then / is not a multiple of m, and since 
m is irreducible, the hcf of / and m is 1. Therefore there exist a. h £ K[t] such that 
af + bm = 1. We claim that the multiplicative inverse of I + / is I + a. To prove this, 
compute: 


(/ + /)(/ + «) =/ + fa = / + (! -bm) =/+ 1 
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since bin £ I by definition. But 1+ 1 is the multiplicative identity of K\t\/I. Therefore 
K[t]/I is a field. 

Define t : K —> K[t]/I by i(k) = I + k. It is easy to check that I is a homomor¬ 
phism. We show that it is one-to-one. If a ^ b £ K then clearly a — b f (m), so 
l(a) l{b ). Therefore I is a monomorphism. □ 

It is easy to see that the minimal polynomial of I + t £ K[t]/I over K is m(t). 
Indeed, m(I + t) = I + m(t) =1 + 0. (This is the only place we use the fact that m is 
monic. But if m is irreducible and not monic, then some multiple km, with k £ K, is 
irreducible and monic; moreover, m and km determine the same ideal I.) 

This proof can be made more elegant and more general: see Exercise 17.2. We 
can (and do) identify K with its image l(K), so we can assume without loss of gen¬ 
erality that K C K\t\/I. We now prove a classification theorem for simple algebraic 
extensions: 

Theorem 17.3. Let K(a) : K be a simple algebraic extension, where a has minimal 
polynomial m over K. Then K{a) : K is isomorphic to K[t\/I: K, where I is the ideal 
of K[t] consisting of all multiples of m. Moreover, there is a natural isomorphism in 
which a i—>■ the coset I + t. 

Proof Define a map <j> : —► K(a) by (j>{f{t)) = /(a). This is clearly a ring ho¬ 

momorphism. Its image is the whole of K(a), and its kernel consists of all multiples 
of m(t) by Lemma 5.6 (generalised). Now K(a) = im(0) = K[t]/ker((j)) = K[t]/I, 
as required. □ 

We can now prove a preliminary version of the result that K and m between them 
determine the extension K(a). 

Theorem 17.4. Suppose K{a) : K and K(p) : K are simple algebraic extensions, 
such that a and ft have the same minimal polynomial m over K. Then the two exten¬ 
sions are isomorphic, and the isomorphism of the large fields can be taken to map a 
to p. 

Proof This is an immediate corollary of Theorem 17.3. □ 


17.3 Splitting Fields 

In Chapter 9 we defined the term ‘splitting field’: a polynomial / € K[t] splits in L 
if it can be expressed as a product of linear factors over L , and the splitting field E of 
/ is the smallest such L. There, we appealed to the Fundamental Theorem of Algebra 
to construct the splitting field for any given complex polynomial. In the general case, 
the Fundamental Theorem of Algebra is not available to us. (There is a version of 
it. Exercise 17.3, but in order to prove that version, we must be able to construct 
splitting fields without appealing to that version of the Fundamental Theorem of 
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Algebra.) And there is no longer a unique splitting field—though splitting fields are 
unique up to isomorphism. 

We start by generalising Definitions 9.1 and 9.3. 

Definition 17.5. If K is a field and / is a nonzero polynomial over K, then / splits 
over K if it can be expressed as a product of linear factors 

f{t)=k(t-a 1 )...(t-a n ) 

where k, a \,..., a„ € K. 

Definition 17.6. Let K be a field and let E be an extension of K. Then E is a splitting 
field for the polynomial / over K if 

(1) / splits over E. 

(2) If K C E' C E and / splits over E' then E' = E. 

Our aim is to show that for any field K , any polynomial over K has a splitting 
field E, and this splitting field is unique up to isomorphism of extensions. 

The work that we have already done allows us to construct, in the abstract, any 
simple extension of a field K. Specifically, any simple transcendental extension K(a) 
of K is isomorphic to the field K(t) of rational expressions in t over K. And if in £ K[t] 
is irreducible and monic, and I is the ideal of K[t] consisting of all multiples of m, 
then K[t]/I is a simple algebraic extension K(a.) of K where a = I + t has minimal 
polynomial m over K. Moreover, all simple algebraic extensions of K arise (up to 
isomorphism) by this construction. 

Definition 17.7. We refer to these constructions as adjoining a to K. 

When we were working with subfields K of C, we could assume that the ele¬ 
ments) being adjoined were in C, so all we had to do was take the field they gener¬ 
ate, together with K. Now we do not have a big field in which to work, so we have to 
create the fields along with the elements we need. 

We construct a splitting field by adjoining to K elements that are to be thought of 
as the zeros of /. We already know how to do this for irreducible polynomials, see 
Theorem 17.2, so we split / into irreducible factors and work on these separately. 

Theorem 17.8. If K is any field and f is any nonzero polynomial over K, then there 
exists a splitting field for f over K. 

Proof Use induction on the degree df. If df = 1 there is nothing to prove, for / 
splits over K. If / does not split over K then it has an irreducible factor f\ of degree 
> 1. Using Theorem 5.7 (generalised) we adjoin 0\ to K , where f\[o \) = 0. Then in 
K ( CT]) [f ] we have f ={t — C7i )g where dg = df — 1. By induction, there is a splitting 
field E for g over K{d\). But then E is clearly a splitting field for / over K. □ 

It would appear at first sight that we might construct different splitting fields for 
/ by varying the choice of irreducible factors. In fact splitting fields (for given / 
and K ) are unique up to isomorphism. The statements and proofs are exactly as in 
Lemma 9.5 and Theorem 9.6, and we do not repeat them here. 
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17.4 Normality 

As before, the key properties that drive the Galois correspondence are normality 
and separability. We discuss normality in this section, and separability in the next. 

Because we suppressed explicit use of ‘over C’ from our earlier definition, it 
remains seemingly unchanged: 

Definition 17.9. A field extension L.K is normal if every irreducible polynomial / 
over K that has at least one zero in L splits in L. 

So does the proof of the main result about normality and splitting fields: 

Theorem 17.10. A field extension L : K is normal and finite if and only if L is a 
splitting field for some polynomial over K. 

Proof. The same as for Theorem 9.9, except that ‘the splitting field’ becomes ‘a 
splitting field’. □ 

Finally we need to discuss the concept of a normal closure in the abstract context. 
For subfields of C the normal closure of an extension L : K is an extension N of L 
such that N : K is normal, and N is as small as possible subject to this condition. We 
proved existence by taking a suitable splitting field, yielding a normal extension of K 
containing L, and then finding the unique smallest subfield with those two properties. 

For abstract fields, we have to proceed in a similar but technically different man¬ 
ner. The proof of Theorem 11.6 still constructs a normal closure, because this is 
defined there using a splitting field, which we construct using Theorem 17.8. The 
only difference is that the normal closure is now unique up to isomorphism. That is, 
if Ni : K and N 2 : K are normal closures of L.K, then the extensions N\ : L and A?: L 
are isomorphic. This follows because splitting fields are unique up to isomorphism, 
as remarked immediately after Theorem 17.8. 


17.5 Separability 

We generalise Definition 9.10: 

Definition 17.11. An irreducible polynomial / over a field K is separable over K if 
it has no multiple zeros in a splitting field. 

Since the splitting field is unique up to isomorphism, it is irrelevant which split¬ 
ting field we use to check this property. 

Example 17.12. Consider fit) =t 2 +t + 1 over Z 2 . This time we cannot use C, so 
we must go back to the basic construction for a splitting field. The field Z 2 has two 
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elements, 0 and 1. We note that / is irreducible, so we may adjoin an element such 
that £ has minimal polynomial / over Z 2 . Then £ 2 + £ + 1 = 0 so that £ 2 = 1 + £ 
(remember, the characteristic is 2) and the elements 0,1, £, 1 + £ form a field. This 
follows from Theorem 5.10 (generalised). It can also be verified directly by working 
out addition and multiplication tables: 


+ 

0 

1 

c 

1 + C 

0 

0 

1 

c 

1 + C 

1 

1 

0 

1 + C 

c 

C 

C 

1 + C 

0 

1 

I + C 

i + C 

c 

1 

0 


0 

1 

c 

1 + C 

0 

0 

0 

0 

0 

1 

0 

1 

c 

1 + C 

C 

0 

C 

1 + C 

1 

1 + C 

0 

1 + C 

1 

c 


A typical calculation for the second table runs like this: 

C(i + C) = C + C 2 = C + C + i = 1 

Therefore 2L(C) is a field with four elements. Now / splits over 2L(C): 

t 2 + t +1 = (t-Q(t- 1-C) 

but over no smaller field. Hence 2^(0 is a splitting field for / over Z 2 . 

We have now reached the point at which the theory of fields of prime character¬ 
istic p starts to differ markedly from that for characteristic zero. A major difference 
is that separability (see Definition 9.10) can, and often does, fail. To investigate this 
phenomenon, we introduce a new term: 

Definition 17.13. An irreducible polynomial over a field K is inseparable over K if 
it is not separable over K. 

We are now ready to prove the existence of a very useful map. 

Lemma 17.14. Let K be a field of characteristic p > 0. Then the map (f> : K —> 
K defined by (j>(k) =kP (k £ K) is a field monomorphism. If K is finite, <j> is an 
automorphism. 

Proof. Let x, y € K. Then 

0 (xy) = ( xy) p = x p y p = <j> (x)0 (y) 

By the binomial theorem. 


<K* + y) = {x+y) p =x p + P x p ~ l y+ f £ j x p ~ 2 y 2 + ■ ■ ■ + pxy p ~ l +y p (17.1) 
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Since the characteristic is p. Lemma 3.21 implies that the sum in (17.1) reduces to 
its first and last terms, and 

<j)(x+y) =x p +y p = 0(x) + 0(y) 

We have now proved that <j> is a homomorphism. 

To show that 0 is one-to-one, suppose that 0(x) = 0(y). Then <j>(x — y) =0. So 
{x — y) p = 0, so x = y. Therefore 0 is a monomorphism. 

If K is finite, then any monomorphism K—yKis automatically onto by counting 
elements, so 0 is an automorphism in this case. □ 

Definition 17.15. If K is a field of characteristic p > 0, the map <j> : K —¥ K defined 
by 0(fc) = k p (k £ K ) is the Frobenius monomorphism or Frobenius map of K. When 
K is finite, 0 is called the Frobenius automorphism of K. 

If you try this on the field Z 5 , it turns out that 0 is the identity map, which 
is not very inspiring. The same goes for Z p for any prime p. But for the field of 
Example 17.12 we have 0(0) = 0, 0(1) = 1, 0(C) = 1 + £, 0(1 +£) = £, so that 0 
is not always the identity. 

Example 17.16. We use the Frobenius map to give an example of an inseparable 
polynomial. Let Kq = Z p for prime p. Let K = Kq(u) where u is transcendental over 
K(), and let 

f(t) =t p — m £ K[t] 

Let E be a splitting field for / over K , and let T be a zero of / in E. Then x p = u. 
Now use the Frobenius map: 

{t _ T)P = t P_ T P = t P_ u = f{t) 

Thus if a p — u = 0 then (o — x) p = 0 so that a = t; all the zeros of / in E are equal. 

It remains to show that / is irreducible over K. Suppose that / = gh where g.h £ 
K[t\, and g and h are monic and have lower degree than /. We must have g(t) = (t — 
z) s where 0 < ,v < p by uniqueness of factorisation. Hence the constant coefficient 
(— t) s of g lies in K. This implies that T £ K. for there exist integers a and b such that 
as + bp = 1, and since % as+b P g K it follows that T £ K. Then T = v(u)/w(u) where 
v,w £ Ko[u), so 

v(u) p — u(w(u)) p = 0 

But the terms of highest degree cannot cancel. Hence / is irreducible. 

The formal derivative Df of a polynomial / can be defined for any underlying 
field K: 

Definition 17.17. Suppose that K is a field, and let 

f(t ) = ao a it —L * * * — a n t n £ K\f\ 

Then the formal derivative of / is the polynomial 


It — 1 


Df = a\+ 2a2t H-b na„t‘ 



200 


Abstract Field Extensions 


Note that here the elements 2 ,...,n belong to K, not Z. In fact they are what we 
briefly wrote as 2 n* in the proof of Theorem 16.9. 

Lemma 9.13 states that a polynomial f ^ 0 has a multiple zero in a splitting field 
if and only if / and Df have a common factor of degree > 1. This lemma remains 
valid over any field, and has the same proof. Using the formal derivative, we can 
characterise inseparable irreducible polynomials: 

Proposition 17.18. If K is afield of characteristic 0, then every irreducible polyno¬ 
mial over K is separable over K. 

If K has characteristic p > 0, then an irreducible polynomial f over K is insep¬ 
arable if and only if 

fit) = k$ T k\fP + • • • + k r t rp 

where ko,...,k r £ K. 

Proof By Lemma 9.13 (generalised), an irreducible polynomial / over K is insepa¬ 
rable if and only if / and Df have a common factor of degree > 1. If so, then since 
/ is irreducible and Df has smaller degree than /, we must have Df = 0. Thus if 

fit) = ao H-b a m t m 

then na„ = 0 for all integers n > 0. For characteristic 0 this is equivalent to a„ - 0 
for all n. For characteristic p > 0 it is equivalent to a n - 0 if p does not divide n. Let 
kj = ajp, and the result follows. □ 

The condition on / for inseparability over fields of characteristic p can be ex¬ 
pressed by saying that only powers of t that are multiples of p occur. That is 
f{t) = g{t p ) for some polynomial g over K. 

We now define two more uses of the word ‘separable’. 

Definition 17.19. If L : K is an extension then an algebraic element a £ L is separa¬ 
ble over K if its minimal polynomial over K is separable over K. 

An algebraic extension L.K is a separable extension if every a £ L is separable 
over K. 

For algebraic extensions, separability carries over to intermediate fields. 

Lemma 17.20. Let L : K be a separable algebraic extension and let M be an inter¬ 
mediate field. Then M : K and L : M are separable. 

Proof. Clearly M : K is separable. Let a £ L. and let m K and m M be its minimal 
polynomials over K,M respectively. Now iiim\iiik in M[t ]. But a is separable over 
K so nix is separable over K, hence him is separable over M. Therefore L : M is a 
separable extension. □ 

We end this section by proving that an extension generated by the zeros of a 
separable polynomial is separable. To prove this, we first prove: 
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Lemma 17.21. Let L : K be a field extension where the fields have characteristic 
p, and let a G L be algebraic over K. Then a is separable over K if and only if 
K(a p ) = K(a). 

Proof Since a is a zero of t p — a p G K(a p )[t], which equals (t — a) p by the Frobe- 
nius map, the minimal polynomial of a over K(a p ) must divide (t — a) p and hence 
be ( t — a) s for some s < p. 

If a is separable over K then it is separable over K{a p ). Therefore (t — a) s has 
simple zeros, so s = 1. Therefore a G K(a p ), so K(a p ) = K(a). 

For the converse, suppose that a is inseparable over K. Then its minimal polyno¬ 
mial over K has the form g(t p ) for some g G K[t\. Thus a has degree pdg over K. In 
contrast, a p is a zero of g, which has smaller degree dg. Thus K(a p ) and Kia. ) have 
different degrees over K, so cannot be equal. □ 

Theorem 17.22. IfL : K is afield extension such that L is generated over K by a set 
of separable algebraic elements, then L.Kis separable. 

Proof We may assume that K has characteristic p. It is sufficient to prove that the 
set of elements of L that are separable over K is closed under addition, subtraction, 
multiplication, and division. (Indeed, subtraction and division are enough.) We give 
the proof for addition: the other cases are similar. 

Suppose that a,fi G L are separable over K. Observe that 

K(a + p,p)=K(a,p) = K(a p ,p p ) = K(a p + p p ,p p ) (17.2) 

using Lemma 17.21 for the middle equality. Now consider the towers 

KCK{a + p) CK{a + p,l}) 

KCK(a p + p p )CK(a p + p p ,pp) 

and consider the corresponding degrees. Apply the Frobenius map to minimal poly¬ 
nomials to see that 

[K(a p + p p ,p p ) : K(a p + fi p )} < [K{a + j3,j3): K{a + j3)] 
and 

[K(a p + p p ):K]<[K(a + p):K] 

However, 

[K(a p + fi p ,fi p ) : K] = [K(a + : K] 


by (17.2). Now the Tower Law implies that the above inequalities of degrees must 
actually be equalities. The result follows. □ 
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17.6 Galois Theory for Abstract Fields 

Finally, we can set up the Galois correspondence as in Chapter 12. Everything 
works, provided that we work with a normal separable field extension rather than just 
a normal one. As we remarked in that context, separability is automatic for subfields 
of C. So there should be no difficulty in reworking the theory in the more general 
context. 

Note in particular that Theorem 11.14 (generalised) requires separability for 
fields of prime characteristic. 

Because of its importance, we restate the Fundamental Theorem of Galois The¬ 
ory: 

Theorem 17.23 (Fundamental Theorem of Galois Theory, General Case). 

If L : K is a finite separable normal field extension, with Galois group G, and if 
& ^ are defined as before, then: 

(1) The Galois group G has order [L : K], 

(2) The maps * and ' are mutual inverses, and set up an order-reversing one-to- 
one correspondence between fp and ( S. 

(3) IfM is an intermediate field, then 

[L: M] = \M *| [M:K] = \G\/\M*\ 

(4) An intermediate field M is a normal extension ofK if and only ifM* is a normal 
subgroup of G. 

(5) If an intermediate field M is a normal extension ofK, then the Galois group of 
M : K is isomorphic to the quotient group G/M*. 

Proof Mimic the proof of Theorem 12.2 and look out for steps that require separa¬ 
bility. □ 

Another thing to look out for is the uniqueness of the splitting field of a polyno¬ 
mial: now it is unique only up to isomorphism. For example, we defined the Galois 
group of a polynomial / over K to be the Galois group of E : K, where E is the split¬ 
ting field of /. When K is a subfield of C, the subfield E is unique. In general it is 
unique up to isomorphism, so the Galois group of / is unique up to isomorphism. 
That suits us fine. 

What about radical extensions? In characteristic p, inseparability raises its ugly 
head, and its effect is serious. For example, t p — 1 = (? — l) p , by the Frobenius map, 
so the only pth root of unity is 1. The definition of ‘radical extension’ has to be 
changed in characteristic p, and we shall not go into the details. However, everything 
carries through unchanged to fields with characteristic 0. 

We have now reworked the entire theory established in previous chapters, gener¬ 
alising from sub fields of C to arbitrary fields. Now we can pick up the thread again, 
but from now on, the abstract formalism is there if we need it. 
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EXERCISES 

17.1 Let K be a field, and let /(f) G K[t], Prove that the set of all multiples of / is 
an ideal of /f[f], 

17.2 Let 0 : K —> R be a ring homomorphism, where K is a field and R is a ring. 
Prove that (j) is one-to-one. (Note that in this book rings have identity elements 
1 and homomorphisms preserve such elements.) 

17.3* Prove by transfinite induction that every field can be embedded in an alge¬ 
braically closed field, its algebraic closure. (Hint: Keep adjoining zeros of 
irreducible polynomials until there are none left.) 

17.4* Prove that algebraic closures are unique up to isomorphism. More strongly, if 
K is any field, and A. B are algebraic closures of K, show that the extensions 
A : K and B : K are isomorphic. 

17.5 Let A denote the set of all complex numbers that are algebraic over Q. The 
elements of A are called algebraic numbers. Show that A is a field, as follows. 

(a) Prove that a complex number a € A if and only if [Q(a): Q] < °°. 

(b) Let a, G A. Use the Tower Law to show that Q(a, J3): Q] < °°. 

(c) Use the Tower Law to show that [Q(a + j3): Q] < °°, [Q(— a): Q] < 

[Q(cejS): Q] < °°, and if a / 0 then [Q(a _1 ): Q] < °o. 

(d) Therefore A is a field. 

17.6 Prove that R[f]/(f 2 + 1) is isomorphic to C. 

17.7 Find the minimal polynomials over the small field of the following elements 
in the following extensions: 

(a) a in K: P where K is the field of Exercise 16.2 and P is its prime subfield. 

(b) a in 2 / (t)(cc) : 2/(f) where t is indeterminate and a 2 = t + 1 . 

17.8 For which of the following values of m(t) do there exist extensions K(a.) of K 
for which a has minimal polynomial m(t)l 

(a) m(t) = t 2 + 1 , K = Z 3 

(b) m(t) = t 2 + l,K = 

(c) m(t) = f — 3f 6 +4 / 3 — t — \,K = M 

17.9 Show that for fields for characteristic 2 there may exist quadratic equations 
that cannot be solved by adjoining square roots of elements in the field. (Hint: 
Try Z 2 .) 
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17.10 Show that we can solve quadratic equations over a field of characteristic 2 if 
as well as square roots we adjoin elements \fk defined to be solutions of the 
equation 

(s/k) 2 + = k. 

17.11 Show that the two zeros of t 2 +t — k = 0 in the previous question are y/k and 

i + Vk. 

17.12 Let K = Z3. Find all irreducible quadratics over K , and construct all possible 
extensions of K by an element with quadratic minimal polynomial. Into how 
many isomorphism classes do these extensions fall? How many elements do 
they have? 

17.13 Mark the following true or false. 

(a) The minimal polynomial over a field K of any element of an algebraic 
extension of K is irreducible over K. 

(b) Every monic irreducible polynomial over a field K can be the minimum 
polynomial of some element a in a simple algebraic extension of K. 

(c) A transcendental element does not have a mimimum polynomial. 

fd) Any field has infinitely many non-isomorphic simple transcendental ex¬ 
tensions. 

(e) Splitting fields for a given polynomial are unique. 

(f) Splitting fields for a given polynomial are unique up to isomorphism. 

(g) The polynomial f 6 — t 3 + 1 is separable over Z3. 



Chapter 18 

The General Polynomial Equation 


As we saw in Chapter 8 , the so-called ‘general’ polynomial is in fact very special. It is 
a polynomial whose coefficients do not satisfy any algebraic relations. This property 
makes it in some respects simpler to work with than, say, a polynomial over Q, and 
in particular it is easier to calculate its Galois group. As a result, we can show that 
the general quintic polynomial is not soluble by radicals without assuming as much 
group theory as we did in Chapter 15, and without having to prove the Theorem on 
Natural Irrationalities, Theorem 8.15. 

Chapter 15 makes it clear that the Galois group of the general polynomial of 
degree n should be the whole symmetric group §„, and we will show that this con¬ 
tention is correct. This immediately leads to the insolubility of the general quintic. 
Moreover, our knowledge of the structure of En, S3, and S4 can be used to find a 
unified method to solve the general quadratic, cubic, and quartic equations. Further 
work, not described here, leads to a method for solving any quintic that is soluble by 
radicals, and finding out whether this is the case: see Berndt, Spearman and Williams 
( 2002 ). 


18.1 Transcendence Degree 

Previously, we have avoided transcendental extensions. Indeed the assumption 
that extensions are finite has been central to the theory. We now need to consider a 
wider class of extensions, which still have a flavour of finiteness. 

Definition 18.1. An extension L : K is finitely generated if L = K(a\ .., a „) where 
n is finite. 

Here the a ; may be either algebraic or transcendental over K. 

Definition 18.2. If oq,..., a„ are transcendental elements over a field K , all lying 
inside some extension L of K , then they are independent if there is no non-trivial 
polynomial p € K[t\,...,t n ] such that p(oq,..., a„) = 0 in L. 

Thus, for example, if t is transcendental over K and u is transcendental over K(t), 
then K(t,u) is a finitely generated extension of K, and t,u are independent. On the 
other hand, t and u -1 2 - 1 are both transcendental over K , but are connected by the 
polynomial equation t 2 + 1 — u = 0, so are not independent. 
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We now prove a condition for a set to consist of independent transcendental ele¬ 
ments. 

Lemma 18.3. Let K CM be fields, a |a, CM, and suppose that «]..... a, _ | 
are independent transcendental elements over K. Then the following conditions are 
equivalent: 

(1) a, is transcendental over CCi,..., Ct,-_i 

(2) a\,...,a r are independent transcendental elements over K. 

Proof. We show that (1) is false if and only if (2) is false, which is equivalent to the 
above statement. 

Suppose (2) is false. Let p{t i,... ,t r ) £ K[t \,... ,t,] be a nonzero polynomial such 
that p(a= 0. Write p = W' =1 pfij where each pj C K\t\,...,t r -i], That 
is, think of p as a polynomial in t r with coefficients not evolving t r . Since p is 
nonzero, some pj must be nonzero. Because o.\,.... a r ] are independent transcen¬ 
dental elements over K , the polynomial pj remains nonzero when we substitute a, 
for t\, with 1 < i < r — 1. This substitution turns p into a nonzero polynomial over 
K(a.\ ,...,CL- 1 ) satisifed by a r , so (1) fails. 

The converse uses essentially the same idea. If (1) fails, then a r satisfies a poly¬ 
nomial in t r with coefficients in K( o. \,..., a, - \). Multiplying by the denominators 
of the coefficients we may assume the coefficients lie in K[a\,... ,i]• But now 
we have constructed a nonzero polynomial in K[t\.... ,t r ] satisfied by the a ; , so (2) 
fails. □ 

The next result describes the structure of a finitely generated extension. The main 
point is that we can adjoin a number of independent transcendental elements first, 
with algebraic ones coming afterwards. 

Lemma 18.4. If L : K is finitely generated, then there exists an intermediate field M 
such that 

(1) M = K{a \,..., CC r ) where the OCj are independent transcendental elements over 
K. 

(2) L.M is a finite extension. 

Proof. We know that L = K (/3j,..., j3„). If all the /i ; are algebraic over K , then L : K 
is finite by Lemma 6.11 (generalised) and we may take M = K. Otherwise some j3, 
is transcendental over K. Call this ofi. If L : K( a. \ ) is not finite, there exists some 
P/, transcendental over K(a.\). Call this 0 C 2 . We may continue this process until M = 
K(a u ..., a,) is such that L.M is finite. By Lemma 18.3, the a 7 are independent 
transcendental elements over K. □ 

A result due to Ernst Steinitz says that the integer r that gives the number of 
independent transcendental elements does not depend on the choice of M. 
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Lemma 18.5 (Steinitz Exchange Lemma). With the flotation of Lemma 18.4, if 
there is another intermediate field N = K{f \,... , As) such that j3i,... ,f s are inde¬ 
pendent transcendental elements over K and L : N is finite, then r = s. 

Proof The idea of the proof is that if there is a nontrivial polynomial relation involv¬ 
ing a, and /3 ; , then we can swap them, leaving the field concerned the same except 
for some finite extension. Inductively, we replace successive a, by /j, until all /j, have 
been used, proving that s < r. By symmetry, r < s and we are finished. 

The details require some care. We claim inductively on m, that: 

If 0 < m < s, then renumbering the (Xj if necessary, 

(1) L : K( J3i,.. .,Pm,a m + 1 ,• • •,a,-) is finite. 

(2) J3i,.... /j,„. a,„ i,..., a, are independent transcendental elements over K. 

The renumbering simplifies the notation, and is also carried out inductively. No 

a, is renumbered more than once. 

Claims (1, 2) are true when m = 0; in this case, no /j, occurs, and the conditions 
are the same as those in Lemma 18.4. 

Assuming (1, 2), we must prove the corresponding claims for m+ 1. To be ex¬ 
plicit, these are: 

(1') L : K(p u ..., j3„,+i, a m+2 ,. ■., a r ) is finite. 

( 2 ') j3j,..., An+i, a m+ 2 : ■ ■ ■, CC, are independent transcendental elements over K. 
We have m +1 < s, so A»+i exists. It is algebraic over K(f \,..., j3,„, a,„+i,..., a r ) 
by (1). Therefore there is some polynomial equation 

P { ft I ... 5 A/1+1 ‘ t-L; ■ I ■ ... , CCy) =0 (18.1) 

in which both A«+t and some a,- actually occur. (That is, each appears in some term 
with a nonzero coefficient.) Renumbering if necessary, we can assume that this 0Cj is 
OCm+i. Define four fields: 

Kq = K (/3l . . ., fin ■ 17 tXm-v 17 • • • 7 

A"l - K(fi\ ... 7 All 7 ^*“117+1 7 * * * 7 OJ/‘) 

A/7 = . . . , All+l, CTlll+27 * * ’ 7 O-r) 

^3 = ^(/3l ..., fm 7 ^m+2 7***7 A*r) 

Then /f 3 C /f,, /f, C /f 2 , /f, C K 2 CK Q . 

To prove (1'), observe that A'r, 7) A", and L : K\ is finite by (2), so L: Kq is finite. 
But Kq : K 2 is finite by (18.1). By the Tower Law, L :K 2 is finite. This is (2'). 

To prove (2'), suppose it is false. Then there is a polynomial equation 

P { f I • • • 7 All+l 7 ^*171+2 7 * * * 7 ^*l) = ^ 

The element An+i actually occurs in some nonzero term, otherwise (2) is false. 
Therefore j3„,+i is algebraic over /Cj, so 7C : K 2 is finite, so L : K\ is finite by (T) 
which we have already proved. Therefore K\ : K 2 is finite, but this contradicts (1). 

This completes the induction. Continuing up to m = s we deduce that s < r. 
Similarly r < s, so r = s. □ 
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Definition 18.6. The integer r defined in Lemma 15.1 is the transcendence degree 
of L .K. By Lemma 18.5, the value of r is well-defined. 

For example consider K{t , a, u ): K , where t is transcendental over K , a 2 = t, and 
u is transcendental over K(t,a). Then M = K(t,u) where t and u are independent 
transcendental elements over K , and 

K(t,a,u): M = M{a ): M 

is finite. The transcendence degree is 2. 

The degree [L : M] of the algebraic part is not an invariant, see Exercise 18.3. 

It is straightforward to show that an extension K ( a. i,..., a,-) : K by independent 
transcendental elements a, is isomorphic to K(t\.... ,t r ): K where K(t\.... ,t r ) is the 
field of rational expressions in the indeterminates t,. In consequence: 

Proposition 18.7. A finitely generated extension L : K has transcendence degree r if 
and only if there is an intermediate field M such that L is a finite extension ofM and 
M : K is isomorphic to K{t \,... ,t r ) ■ K. 

Corollary 18.8. if L : K is a finitely generated extension, and E is a finite extension 
ofL, then the transcendence degrees ofE and L over K are equal. 


18.2 Elementary Symmetric Polynomials 

Usually we are given a polynomial and wish to find its zeros. But it is also pos¬ 
sible to work in the opposite direction: given the zeros and their multiplicities, re¬ 
construct the polynomial. This is a far easier problem which has a complete general 
solution, as we saw in Section 8.7 for complex polynomials. We recap the main ideas. 

Consider a monic polynomial of degree n having its full quota of n zeros (count¬ 
ing multiplicities). It is therefore a product of n linear factors 


f{t) = {t - a x ).. .{t - a n ) 


where the a, are the zeros in K (not necessarily distinct). Suppose that 
/"(f) — a@ T ait -(- • • • -{- a n — \t n ^-t -t n 

If we expand the first product and equate coefficients with the second expression, we 
get the expected result: 


a n -1 — — (oci H-h Ct„) 

a n -2 = (aia 2 + aia 3 H-ba„_ia„) 


ao = (-l)"aia 2 ...a„ 
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The expressions in a \,..., a„ on the right are the elementary symmetric polynomials 
of Chapter 8 , but now they are more generally interpreted as elements of K[t\ ,..., t n \ 
and evaluated at tj = (Xj, for 1 < j < n. 

The elementary symmetric polynomials are symmetric in the sense that they are 
unchanged by permuting the indeterminates tj. This property suggests: 

Definition 18.9. A polynomial q € K[t \,... ,t n ] is symmetric if 

1) i • ■ ■ da(n )) ^(fi,... ,f«) 

for all permutations o £ S„. 

There are other symmetric polynomials apart from the elementary ones, for ex¬ 
ample t\ H-b /“, but they can all be expressed in terms of elementary symmetric 

polynomials: 

Theorem 18.10. Over a field K, any symmetric polynomial in t\,...,t n can be ex¬ 
pressed as a polynomial of smaller or equal degree in the elementary symmetric 
polynomials s r (t i,... ,t„){r = 0 , .. .,n). 

Proof. See Exercise 8.4 (generalised to any field). □ 

A slightly weaker version of this result is proved in Corollary 18.12. We need 
Theorem 18.10 to prove that K is transcendental (Chapter 24). The quickest proof 
of Theorem 18.10 is by induction, and full details can be found in any of the older 
algebra texts (such as Salmon 1885 page 57, Van der Waerden 1953 page 81). 


18.3 The General Polynomial 

Let K be any field, and let t\.. ,t n be independent transcendental elements over 
K. The symmetric group §„ can be made to act as a group of /("-automorphisms of 
K(t\,... ,t n ) by defining 

= ta(i) 

for all ff £ §„, and extending any rational expressions 0 by defining 

(/l j • • •;/«)) 0 ^<r(l)) • • • 'G(n)) 

It is easy to prove that cr, extended in this way, is a K -automorphism. 

For example, if n = 4 and a is the permutation 

/ 1234 \ 

\ 2431 ) 

then cr(fi) = Pi. oih) = G, G(h) = an d g{G) = 0- Moreover, as a typical case, 

/ t\u \ _ tlt\ 

' V f 2 4 -7f 3 y t\-lG 


a 
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Clearly distinct elements of §„ give rise to distinct /^-automorphisms. 

The fixed field F of S n obviously contains all the symmetric polynomials in the f,, 
and in particular the elementary symmetric polynomials s r = s r (t\ We show 

that these generate F. 

Lemma 18.11. With the above notation, F = K(s \,... ,s„). Moreover, 

[K(ti,...,t n ) :K(s U ---,s n )} =n\ (18.2) 

Proof. Clearly L = K{t \,...,/«) is a splitting field of /(f) over both K(s i,... ,s n ) and 
the possibly larger field F. Since §„ fixes both of these fields, the Galois group of each 
extension contains S„, so must equal §„. Therefore the fields F and K(s\ ,..., s n ) are 
equal. Equation (18.2) follows by the Galois correspondence. □ 

Corollary 18.12. Every symmetric polynomial in t\,...,t n over K can be written as 
a rational expression in si,...,s„. 

Proof. By definition, symmetric polynomials are precisely those that lie inside the 
fixed field F of By Lemma 18.11, F = K[s \,... ,s n ). □ 

Compare this result with Theorem 18.10. 

Lemma 18.13. With the above notation, si,...,s n are independent transcendental 
elements over K. 

Proof. By 18.2, K(ti,... ,t„) is a finite extension of K(s i,... ,s n ). By Corollary 18.8 
they both have the same transcendence degree over K, namely n. Therefore the Sj 
are independent, for otherwise the transcendence degree of K(s i,... ,s„) : K would 
be smaller than n. □ 

Definition 18.14. Let K be a field and let ,v |,... ,s„ be independent transcendental 
elements over K. The general polynomial of degree n ‘over’ K is the polynomial 

t n -S l t n ~ l +S2t n - 2 - + 

over the field K(s i,..., s„). 

The quotation marks are used because technically the polynomial is over the field 
K(s i,.. .,s ,,), not over K. 

Theorem 18.15. For any field K let g be the general polynomial of degree n ‘over’ 
K, and let E be a splitting field for g over K(s i,... Then the zeros t\,... ,t„ of 
g in E are independent transcendental elements over K, and the Galois group of 
E : K(si ,... ,s n ) is the symmetric group §„. 

Proof. The extension E : K(si,... ,s n ) is finite by Theorem 9.9, so the transcendence 
degree of E : K is equal to that ofK(si,...,s„) : K, namely n. Since Y. = K(ti,...,t n ), 
the tj are independent transcendental elements over K, since any algebraic relation 
between them would lower the transcendence degree. The Sj are now the elemen¬ 
tary symmetric polynomials in t\,...,t n by Theorem 18.10. As above, S„ acts as a 
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group of automorphisms of E = K{t\,t n ), and by Lemma 15.3 the fixed field is 
K(s i,.... s n ). By Theorem 11.14, E : K(s\....,s„ ) is separable and normal (normal¬ 
ity also follows from the definition of E as a splitting field), and by Theorem 10.5 its 
degree is |S„| = n\. Then by Theorem 17.23(1) the Galois group has order n\, and 
contains §„, so it equals S„. □ 

Theorem 15.8 and Corollary 14.8 imply: 

Theorem 18.16. IfK is afield of characteristic zero and n > 5, the general polyno¬ 
mial of degree n ‘over’ K is not soluble by radicals. 


18.4 Cyclic Extensions 

Theorem 18.16 does not imply that any particular polynomial over K of degree 
n > 5 is not soluble by radicals, because the general polynomial ‘over’ K is actually a 
polynomial over the extension field K(s i,..., s n ), with n independent transcendental 
elements Sj. For example, the theorem does not rule out the possibility that every 
quintic over ‘K might be soluble by radicals, but that the formula involved varies so 
much from case to case that no ‘general’ formula holds. 

However, when the general polynomial of degree n ‘over’ K can be solved by 
radicals, it is easy to deduce a solution by radicals of any polynomial of degree n 
over K , by substituting elements of K for si,..., s„ in that solution. This is the source 
of the ‘generality’ of the general polynomial. From Theorem 18.16, the best that we 
can hope for using radicals is a solution of polynomials of degree < 4. We fulfil this 
hope by analysing the structure of S„ for n < 4, and appealing to a converse to The¬ 
orem 15.8. This converse is proved by showing that ‘cyclic extensions’—extensions 
with cyclic Galois group—are closely linked to radicals. 

Definition 18.17. Let L.K be a finite normal extension with Galois group G. The 
norm of an element a £ Lis 


N(a) = Tj(a)Ti(a)... X n (a) 
where Ti,..., T„ are the elements of G. 

Clearly N(a) lies in the fixed field of G (use Lemma 10.4) so if the extension is 
also separable, then N(a) £ K. 

The next result is traditionally referred to as Hilbert’s Theorem 90 from its ap¬ 
pearance in his 1893 report on algebraic numbers. 

Theorem 18.18 (Hilbert’s Theorem 90). Let L.Kbe a finite normal extension with 
cyclic Galois group G generated by an element X. Then a £ L has norm N(a) = 1 if 
and only if 

a = b/x(b) 

for some b £ L, where b ^ 0. 
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Proof. Let |G| = n. If a = b/x{b) and b yXO then 

N(a) = ax(a)x 2 {a)... T n- 1 (a) 

b x{b) x 2 (b) X n ~ 1 (b) 

x(b) x 2 (b) x 3 (b) x"(b) 

= 1 


since x n = 1 . 

Conversely, suppose that N(a) = 1. Let c <£ L, and define 
do = ac 

d\ = (ax(a))x(c) 

dj = \ax{a)... x'(a)]x'{c) 


for 0 < j < n — 1. Then 


d n - l =N{a)x n ~\c) = x n -\c) 

Further, 

dj + 1 = ax(dj) (0 < j < n — 2) 

Define 


b — do+d\-\ - 1 - d, j—] 

We choose c to make b Suppose on the contrary that b = 0 for all choices of c. 
Then for any c£l 


AoT°(c) + Ait(c)h - 1 (c) = 0 


where 

Xj = ax (a)... x J (a) 

belongs to L. Hence the distinct automorphisms X J are linearly dependent over L, 
contrary to Lemma 10.1. 

Therefore we can choose c so that b f 0. But now 

x (b) = x(d 0 )-\ - hx(d„-i) 

= (l/a)(di H- -\-d n — t) + x n (c) 

= (l/a)(do~l -F d n — i) 

= b/a 


Thus a = b/x(b) as claimed. □ 

Theorem 18.19. Suppose that L : K is a finite separable normal extension whose 
Galois group G is cyclic of prime order p, generated by X. Assume that the charac¬ 
teristic of K is 0 or is prime to p, and that t p — 1 splits in K. Then L = K{tt), where 
a is a zero of an irreducible polynomial t p — a over Kfor some a € K. 
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Proof. The p zeros of t p — 1 from a group of order p, which must therefore be cyclic, 
since any group of prime order is cyclic. Because a cyclic group consists of powers 
of a single element, the zeros of t p — 1 are the powers of some £ £ K where £ p = 1. 
But then 

N(e) = £...£ = 1 

since £ £ K, so T'(e) = e for allBy Theorem 18.18, £ = a/x(a) for some a £ L. 
Therefore 

T(a) = £ _1 a x 2 (a) = e~ 2 a ... x i {a) = £~ i a 

and a = a p is fixed by G, so lies in K. Now Kia) is a splitting field for t p —a over 
K. The //-automorphisms 1, T,..., X p 1 map a to distinct elements, so they give p 
distinct //-automorphisms of K(a). By Theorem 17.23(1) the degree [ K(a ): K] > p. 
But [L: //] = |G| = p, so L = K(a). Hence t p — a is the minimal polynomial of a over 
K , otherwise we would have [//(a) : K] < p. Being a minimal polynomial, t p — a is 
irreducible over K. □ 

We can now prove the promised converse to Theorem 15.8. Compare with 
Lemma 8.17(2). 

Theorem 18.20. Let K be afield of characteristic 0 and let L: K be a finite normal 
extension with soluble Galois group G. Then there exists an extension R of L such 
that R : K is radical. 

Proof. All extensions are separable since the characteristic is 0. Use induction on 
|G|. The result is clear when G = 1. If |G| f 1, consider a maximal proper normal 
subgroup H of G, which exists since G is a finite group. Then G/H is simple, since H 
is maximal, and is also soluble by Theorem 14.4(2). By Theorem 14.6, G/H is cyclic 
of prime order p. Let A be a splitting field over L of t p — 1. Then N : K is normal, for 
by Theorem 9.9 L is a splitting field over K of some polynomial /, so A is a splitting 
field over L of (t p — 1)/, which implies that A : K is normal by Theorem 9.9. 

The Galois group of A : L is abelian by Lemma 15.6, and by Theorem 17.23(5) 
r(L : K ) is isomorphic to T(A : K)/T(N : L). By Theorem 14.4(3), T(A : K ) is solu¬ 
ble. Let M be the subfield of A generated by K and the zeros of t p — 1. Then A : M is 
normal. Now M : K is clearly radical, and since LCN the desired result will follow 
provided we can find an extension R of A such that R : M is radical. 

We claim that the Galois group of A : M is isomorphic to a subgroup of G. Let us 
map any M-automorphism x of A into its restriction T | /.. Since L.K is normal, T | / is 
a //-automorphism of L, and there is a group homomorphism 

0 : r(A: M) —> T(L: K). 

If x £ ker (0) then x fixes all elements of M and L, which generate A. Therefore 
X = 1, so (j) is a monomorphism, which implies that T(A : M) is isomorphic to a 
subgroup J of Y{L : K). 

If J = 0 (r(A : M )) is a proper subgroup of G, then by induction there is an 
extension R of A such that R : M is radical. 
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The remaining possibility is that J = G. Then we can find a subgroup H <3 F(N : 
M) of index p, namely H = Let P be the fixed field H 1 . Then [P:M\=p by 

Theorem 17.23(3), P : M is normal by Theorem 17.23(4), and t p — 1 splits in M. By 
Theorem 18.19 (generalised), P = M(a) where a p = a € M. But N : / J is a normal 
extension with soluble Galois group of order smaller than |G|, so by induction there 
exists an extension R of IV such that R.P is radical. But then R.M is radical, and the 
theorem is proved. □ 

To extend this result to fields of characteristic p > 0, radical extensions must be 
defined differently. As well as adjoining elements a such that a" lies in the given 
field, we must also allow adjunction of elements a such that a p — a lies in the given 
field (where p is the same as the characteristic). It is then true that a polynomial is 
soluble by radicals if and only if its Galois group is soluble. The proof is different 
because we have to consider extensions of degree p over fields of characteristic p. 
Then Theorem 18.19 (generalised) breaks down, and extensions of the second type 
above come in. If we do not modify the definition of solubility by radicals then 
although every soluble polynomial has soluble group, the converse need not hold— 
indeed some quadratic polynomials with abelian Galois group are not soluble by 
radicals, see Exercises 18.13 and 18.14. 

Since a splitting field is always a normal extension, we have: 

Theorem 18.21. Over a field of characteristic zero, a polynomial is soluble by rad¬ 
icals if and only if it has a soluble Galois group. 

Proof. Use Theorems 15.8 and 18.20. 

□ 


18.5 Solving Equations of Degree Four or Less 

The general polynomial of degree n has Galois group §„, and we know that for 
n <4 this is soluble (Chapter 14). Theorem 18.21 therefore implies that for a field K 
of characteristic zero, the general polynomial of degree < 4 can be solved by radicals. 
We already know this from the classical tricks in Chapter 1, but now we can use the 
structure of the symmetric group to explain, in a unified way, why those tricks work. 

Linear Equations 

The general linear polynomial is 


t — .?! 


Trivially t\ = ,v | is a zero. 

The Galois group here is trivial, and adds little to the discussion except to confirm 
that the zero must lie in K. 
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Quadratic Equations 

The general quadratic polynomial is 

t 2 — S\t +S2 

Let the zeros be t\ and t 2 . The Galois group S 2 consists of the identity and a map 
interchanging t\ and t 2 . By Hilbert’s Theorem 90, Theorem 18.18, there must ex¬ 
ist an element which, when acted on by the nontrivial element of §2, is multiplied 
by a primitive square root of 1; that is, by —1. Obviously t\ — 1 2 has this property. 
Therefore 

(O — t 2 ) 2 

is fixed by S2, so lies in K(s \ ,s 2 ). By explicit calculation 

(0 -t2) 2 =s\-As 2 


Hence 


t\ -t 2 = ± \Js\ - 4.S'2 
tl+t 2 = H 


and we have the familiar formula 


h,t 2 = 


si ± 



Cubic Equations 

The general cubic polynomial is 

f 3 — s\t 2 +s 2 t — S3 

Let the zeros be t\ T2T3. The Galois group §3 has a series 

1<A 3 <1S3 


with abelian quotients. 

Motivated once more by Hilbert’s Theorem 90, Theorem 18.18, we adjoin an 
element co 7 ^ 1 such that ft ) 3 = 1. Consider 

y = t\ + cot 2 + 0 ) 2 tj 

The elements of A3 permute t \, t 2 , and f 3 cyclically, and therefore multiply y by a 
power of ft). Hence y 3 is fixed by A3. Similarly if 

z = h + ort 2 + cot 3 

then z 3 is fixed by A3. Now any odd permutation in S3 interchanges y 3 and z 3 , so 
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thaty 3 +z 3 and y 3 z 3 are fixed by the whole of § 3 , hence lie in K(s\,S 2 ,sj). (Explicit 
formulas are given in the final section of this chapter.) Hence y 3 and z 3 are zeros of a 
quadratic over K(si,S 2 ,S 3 ) which can be solved as in part (b). Taking cube roots we 
know y and z. But since 

si = t\ +t 2 +h 


it follows that 


h = j(si +y + z) 
h = j(si +© 2 y + mz) 
h = j(si +coy + (o 2 z) 


Quartic Equations 

The general quartic polynomial is 

f 4 — Slf 3 + S 2 t 2 ~ S^t + S4 

Let the zeros be t\ J 2 J 3 J 4 . The Galois group §4 has a series 

I<IV<A4<S4 


with abelian quotients, where 

V = { 1 , ( 12 )(34), (13)(24), (14)(23)} 

is the Klein four-group. It is therefore natural to consider the three expressions 

yi = (L+h)(G+H) 
yi = (L +G)(G +H) 

J3 = (L +h){t2 +G) 

These are permuted among themselves by any permutation in § 4 , so that all the ele¬ 
mentary symmetric polynomials in y \ ,>’ 2,>’3 lie in K(s\, 52 , 53 , 54 ). (Explicit formulas 
are indicated below). Then >’ 1 , >’ 2 , >’3 are the zeros of a certain cubic polynomial over 
A(si, 52 , 53 , 54 ) called the resolvent cubic. Since 

t\ +?2 +?3 +?4 = 5j 

we can find three quadratic polynomials whose zeros are t\ -K 2 and t;\ +f 4 , t\ and 
to + 14 , t\ +t 4 and t 2 + ? 3 . From these it is easy to find ti,t 2 ,h,t 4 . 

Explicit Formulas 

For completeness, we now state, for degrees 3 and 4, the explicit formulas whose 
existence is alluded to above. Figure 24 shows a picture of Cardano, who first pub¬ 
lished them. For details of the calculations, see Van der Waerden (1953, pages 177- 
182). Compare with Chapter 1 Section 1.4. 



Solving Equations of Degree Four or Less 


111 


Cubic. The Tschirnhaus transformation 

U=t — jil 

converts the general cubic polynomial to 

u + pu + q 

If we can find the zeros of this it is an easy matter to find them for the general cubic. 
The above procedure for this polynomial leads to 

y 3 +z 3 = -21 q 
yV = - 21 P 3 

implying that y 3 and z 3 are the zeros of the quadratic polynomial 

t 2 + 21 qt — 21 p 3 

This yields Cardano’s formula (1.8). 

Quartic. The Tschirnhaus transformation 

u = t — 

reduces the quartic to the form 

t 4 + pt 2 + qt + r 

In the above procedure, 

y i +yi+y3 = 2 p 
ym+ym+yiyi = p 2 -4r 

}’ I }’ 2}’3 = -q 2 

The resolvent cubic is 

t 3 - 2 pt 2 + {p 2 - 4 r)t + q 2 

(a thinly disguised form of (1.12) with t = —2 m ). Its zeros are y i ->’2;>’3, and 

h = kiV-yt+V^n+V-yi) 
h = 5(’/ = >h -\f-n- V^) 
h = H-V^yi+V^yi- v /= y3) 
u = U-V^yi-V^n+V^M) 

Here the signs of the square roots must be chosen so that 

V^yvsf-nsF+n = -q 
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FIGURE 24: Cardano, the first person to publish solutions of cubic and quartic equa¬ 
tions. 


EXERCISES 

18.1 If K is a countable field and L : K is finitely generated, show that L is countable. 
Hence show that R: Q and C : Q are not finitely generated. 

18.2 Calculate the transcendence degrees of the following extensions: 

(a) Q(t,u,v,w) : Q where t,u,v,w are independent transcendental elements 
over Q. 

(b) Q(f ,m, v,w): Q where t 2 =2,u is transcendental over Q(f), v 3 =t+ 5, and 
w is transcendental over Q(t,u,v). 

(c) Q(t,u,v): Q where t 2 = u 3 = v 4 = 7. 

18.3 Show that in Lemma 18.4 the degree [L : M] is not independent of the choice 
of M. (Hint: Consider K(t 2 ) as a subfield of K(t).) 

18.4 Suppose that K C L C M, and each of M:K, L.K is finitely generated. Show 
that M : K and L : K have the same transcendence degree if and only if M : L is 
finite. 


18.5* For any field K show that f 3 — tx+ 1 is either irreducible or splits in K. (Hint: 
Show that any zero is a rational expression in any other zero.) 
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18.6 Suppose that L : K is finite, normal, and separable with Galois group G. For 
any a £ L define the trace 

T (a) = Ti ( a ) H-+ T„ (a) 

where Ti,..., T„ are the distinct elements of G. Show that T(a) £ K and that T 
is a surjective map L—^K. 

18.7 If in the previous exercise G is cyclic with generator T, show that T{a) = 0 if 
and only if a = b — T (b) for some b £ L. 

18.8 Solve by radicals the following polynomial equations over Q: 

(a) f 3 — 7f + 5 = 0 

(b) t 3 — It + 6 = 0 

(c) t 4 + 5t 3 — 2t—l = 0 

(d) f 4 + 4f + 2 = 0 

18.9 Show that a finitely generated algebraic extension is finite, and hence find an 
algebraic extension that is not finitely generated. 

18.10* Let d have minimal polynomial 

t 3 + at" + bt + c 

over Q. Find necessary and sufficient conditions in terms of a.b.c such that 
6 = (j) 2 where 0 £ Q( 0). (Hint: Consider the minimal polynomial of 0.) Hence 
or otherwise express </2& — 3 as a square in Q(and as a square 

in Q(-^5, %2). (See Ramanujan 1962 page 329.) 

18.11 Let r be a finite group of automorphisms of K with fixed field Kq. Let t be 
transcendental over K. For each a £ T show there is a unique automorphism 
a' of K(t) such that 

a’(k) = o(k) (k £ K) 
o’{t) = t 

Show that the a' form a group T' isomorphic to T, with fixed field K+ t ). 

18.12 Let K be a field of characteristic p. Suppose that f(t) = t p — t — a £ K[t}. If 
p is a zero of /, show that the zeros of / are p + k where k = 0,1 ,..., p — 1 . 
Deduce that either / is irreducible over K or / splits in K. 

18.13* If / in Exercise 18.13 is irreducible over K , show that the Galois group of / is 
cyclic. State and prove a characterisation of finite normal separable extensions 
with soluble Galois group in characteristic p. 

18.14 Mark the following true or false. 
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(a) Every finite extension is finitely generated. 

(b) Every finitely generated extension is algebraic. 

(c) The transcendence degree of a finitely generated extension is invariant 
under isomorphism. 

fd) If are independent transcendental elements, then their elemen¬ 

tary symmetric polynomials are also independent transcendental ele¬ 
ments. 

(e) The Galois group of the general polynomial of degree n is soluble for all 

n. 

(f) The general quintic polynomial is soluble by radicals. 

(g) The only proper subgroups of S 3 are 1 and A 3 . 

(h) The transcendence degree of Q (t): Q is 1. 

(i) The transcendence degree of Q(f 2 ): Q is 2. 



Chapter 19 

Finite Fields 


Fields that have finitely many elements are important in many branches of mathe¬ 
matics, including number theory, group theory, and projective geometry. They also 
have practical applications, especially to the coding of digital communications, see 
Lidl and Niederreiter (1986), and, especially for the history, Thompson (1983). 

The most familiar examples of such fields are the fields Z p for prime p, but these 
are not all. In this chapter we give a complete classification of all finite fields. It turns 
out that a finite field is uniquely determined up to isomorphism by the number of 
elements that it contains, that this number must be a power of a prime, and that for 
every prime p and integer n > 0 there exists a field with p" elements. All these facts 
were discovered by Galois, though not in this terminology. 


19.1 Structure of Finite Fields 

We begin by proving the second of these three statements. 

Theorem 19.1. IfF is a finite field, then F has characteristic p > 0, and the number 
of elements ofF is p" where n is the degree ofF over its prime subfield. 

Proof. Let P be the prime subfield of F. By Theorem 16.9, P is isomorphic either to 
Q or to Z p for prime p. Since Q is infinite, P = Z p . Therefore F has characteristic 
p. By Theorem 6.1, F is a vector space over P. This vector space has finitely many 
elements, so \F : P] = n is finite. Let x \,... ,x„ be a basis for F over P. Every element 
of F is uniquely expressible in the form 


i -t- T A n x n 

where Ai,...,A„ £ P. Each A ; - may be chosen in p ways since |P| = p , hence there 
are p n such expressions. Therefore |F| = p". 

□ 

Thus there do not exist fields with 6,10,12,14,18,20,... elements. Notice the 
contrast with group theory, where there exist groups of any given order. However, 
there exist non-isomorphic groups with equal orders. To show that this cannot happen 
for finite fields, we recall the Frobenius map. Definition 17.15, which maps x to 
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x p , and is an automorphism when the field is finite by Lemma 17.14. We use the 
Frobenius automorphism to establish a basic uniqueness theorem for finite fields: 

Theorem 19.2. Let p be any prime number and let q = p" where n is any integer 
> 0. Afield F has q elements if and only if it is a splitting field for f(t) = t q — t over 
the prime subfield P = Z p ofF. 

Proof. Suppose that |F| = q. The set F\{0} forms a group under multiplication, of 
order q— 1, so if 0 7 ^ x £ F then x q ~ 1 = 1. Hence x q — x = 0. Since 0 q — 0 = 0, every 
element of F is a zero of t q — t, so fit) splits in F. Since the zeros of / exhaust F, 
they certainly generate it, so F is a splitting field for / over P. 

Conversely, let K be a splitting field for / over Z„. Since Df = — 1, which is 
prime to /, all the zeros of / in K are distinct, so / has exactly q zeros. The set of 
zeros is precisely the set of elements fixed by (j)'\ that is, its fixed field. So the zeros 
form a field, which must therefore be the whole splitting field K. Therefore \K\ = q. 

□ 

Since splitting fields exist and are unique up to isomorphism, we deduce a com¬ 
plete classification of finite fields: 

Theorem 19.3. A finite field has q = p" elements where p is a prime number and n 
is a positive integer. For each such q there exists, up to isomorphism, precisely one 
field with q elements, which can be constructed as a splitting field for t q — t over Z p . 

Definition 19.4. The Galois Field GW(q) is the unique field with q elements. 


19.2 The Multiplicative Group 

The above classification of finite fields, although a useful result in itself, does not 
give any detailed information on their deeper structure. There are many questions 
we might ask—what are the subfields? How many are there? What are the Galois 
groups? We content ourselves with proving one important theorem, which gives the 
structure of the multiplicative group F\{0} of any finite field F. First we need to 
know a little more about abelian groups. 

Definition 19.5. The exponent e(G) of a finite group G is the least common multiple 
of the orders of the elements of G. 

The order of any element of G divides the order |G|, so e(G) divides |G|. In 
general, G need not possess an element of order e(G). For example if G = S 3 then 
e(G) = 6 , but G has no element of order 6 . Abelian groups are better behaved in this 
respect: 


Lemma 19.6. Any finite abelian group G contains an element of order e(G). 
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Proof. Let e = e(G) = p a f .. .p“” where the pj are distinct primes and Otj > 1. The 

definition of e(G ) implies that for each j, the group G must possess an element gj 

cc cc • 

whose order is divisible by p ■ 1 . Then a suitable power a , of g , has order p ■ 1 . Define 

J J J J 

g = a\a 2 .. .a n (19.1) 


Suppose that g m = 1 where m > 1. Then 


a'J = a x m ... a™ x a™ x .. 


.a 


—m 

n 


So if 


q = p i 


a l 


®/'— 1 O7+1 

■■Pj-lPj+1 -P 


CC n 

n 


then a” w/ = 1. But q is prime to the order of aj, so /;J' divides Hence e divides m. 
But clearly g e = 1. Hence g has order e, which is what we want. □ 


Corollary 19.7. If G is a finite abelian group such that e(G) = |G|, then G is cyclic. 


Proof. The element g in (19.1) generates G. 


□ 


We can apply this corollary immediately. 

Theorem 19.8. If G is a finite subgroup of the multiplicative group AT\{0} of a field 
K, then G is cyclic. 

Proof. Since multiplication in K is commutative, G is an abelian group. Let e = e(G). 
For any x £ G we have xf = 1, so that x is a zero of the polynomial t e — 1 over K. By 
Theorem 3.28 (generalised) there are at most e zeros of this polynomial, so |G| < e. 
But e < |G|, hence e = |G|; by Corollary 19.7, G is cyclic. □ 

Corollary 19.9. The multiplicative group of a finite field is cyclic. 


Therefore for any finite field F there is at least one element x such that every 
non-zero element of F is a power of x. We give two examples. 


Examples 19.10. (1) The field GF(ll). The powers of 2, in order, are 


1,2,4,8,5,10,9,7,3,6,1 


so 2 generates the multiplicative group. On the other hand, the powers of 4 are 

1,4,5,9,3,1 


so 4 does not generate the group. 

(2) The field GF(25). This can be constructed as a splitting field for t 2 — 2 over Z 5 , 
since r 2 — 2 is irreducible and of degree 2. We can therefore represent the elements 
of GF(25) in the form a + ba where a 2 = 2. There is no harm in writing a = y/2. 
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By trial and error we are led to consider the element 2 + s/2. Successive powers of 
this are _ _ 

1 2 + s/2 l+4s/2 4s/2 3 + 3s/2 2+4s/2 2 

4 + 2V2 2 + 3V2 3s/2 l+s/2 4 + 3s/2 4 

3+4s/2 4 + s/2 s/2 2 + 2s/2 3 + s/2 3 

l+3s/2 3 + 2V2 2s/2 4 + 4s/2 l+2s/2 1 

Hence 2 + s/2 generates the multiplicative group. 

There is no known procedure for finding a generator other than enlightened trial 
and error. Fortunately the existence of a generator is usually sufficient information. 


19.3 Application to Solitaire 
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FIGURE 25: The solitaire board 

Finite fields have an unexpected application to the recreational pastime of soli¬ 
taire fde Bruijn 1972). Solitaire is played on a board with holes arranged like Fig¬ 
ure 25. A peg is placed in each hole, except the centre one, and play proceeds by 
jumping any peg horizontally or vertically over an adjacent peg into an empty hole; 
the peg that is jumped over is removed. The player’s objective is to remove all pegs 
except one, which—traditionally—is the peg that occupies the central hole. Can it 
be another hole? Experiment shows that it can, but suggests that the final peg cannot 
occupy any hole. Which holes are possible? 

De Bruijn’s idea is to use the field GF(4), whose addition and multiplication 
tables are given in Exercise 16.6, in terms of elements 0,1,0 !,j 8. Consider the holes 
as a subset of the integer lattice Z 2 , with the origin (0,0) at the centre and the axes 
horizontal and vertical as usual. If A is a set of pegs, define 

A(x)= £ a x+y B(X) = £ a x - y 

(x,y)eX (x,y)eX 



Exercises 


225 


Observe that if a legal move changes X to Y , then A(Y) = A(X),B(Y) = B(X). This 
follows easily from the equation a 2 + a + 1 = 0 , which in turn follows from the 
tables. Thus the pair (. A(X),B(X )) is invariant under any sequence of legal moves. 

The starting position X has A(X) = B(X) = 1. Therefore any position Y that 
arises during the game must satisfy A(Y) = B(Y) - 1. If the game ends with a 
single peg on (x,y) then a x+y = a' y = 1. Now a 3 = 1, so a has order 3 in the 
multiplicative group of nonzero elements of GF(4). Therefore x+y,x — y are mul¬ 
tiples of 3, so x,y are multiples of 3. Thus the only possible end positions are 
(—3,0), (0,-3), (0,0), (0,3), (3,0). Experiment (by symmetry, only (0,0), the tra¬ 
ditional finish, and (3,0) need be attempted; moreover, the same penultimate move 
must lead to both, depending on which peg is moved) shows that all five of these 
positions can be obtained by a series of legal moves. 


EXERCISES 

19.1 For which of the following values of n does there exist a field with n elements? 

1, 2, 3, 4, 5, 6 , 17, 24, 312, 65536, 

65537, 83521, 103823, 2 13466917 - 1 

(Hint: See ‘Mersenne primes’ under ‘Internet’ in the References.) 

19.2 Construct fields having 8 , 9, and 16 elements. 

19.3 Let (j) be the Frobenius automorphism of GF(/?"). Find the smallest value of 
m > 0 such that (f> m is the identity map. 

19.4 Show that the subfields of GF(p") are isomorphic to GF(p r ) where r divides 
n, and there exists a unique subfield for each such r. 

19.5 Show that the Galois group of GF(p") : GF(p) is cyclic of order n, gener¬ 
ated by the Frobenius automorphism (j). Show that for finite fields the Galois 
correspondence is a bijection, and find the Galois groups of 

GF (p n ): GF (/?'") 


whenever m divides n. 

19.6 Are there any composite numbers r that divide all the binomial coefficients (') 
for 1 < s < r — 1 ? 

19.7 Find generators for the multiplicative groups of GF(p") when p" = 8 , 9, 13, 
17,19, 23,29, 31, 37, 41, and 49. 

19.8 Show that the additive group of GF(p”) is a direct product of n cyclic groups 
of order p. 
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19.9 By considering the field Z 2 (t), show that the Frobenius monomorphism is not 
always an automorphism. 

19.10* For which values of n does S n contain an element of order e(S„)? 

(Hint: Use the cycle decomposition to estimate the maximum order of an el¬ 
ement of Sand compare this with an estimate of e(§„). You may need esti¬ 
mates on the size of the nth prime: for example, ‘Bertrand’s Postulate’, which 
states that the interval [n,2n] contains a prime for any integer n > 1.) 

19.11* Prove that in a finite field every element is a sum of two squares. 

19.12 Mark the following true or false. 

(a) There is a finite field with 124 elements. 

(b) There is a finite field with 125 elements. 

(c) There is a finite field with 126 elements. 

fd) There is a finite field with 127 elements. 

(e) There is a finite field with 128 elements. 

(f) The multiplicative group of GF(19) contains an element of order 3. 

(g) GF(2401) has a subfield isomorphic to GF(49). 

(h) Any monomorphism from a finite field to itself is an automorphism. 

(i) The additive group of a finite field is cyclic. 



Chapter 20 

Regular Polygons 


We return with more sophisticated weapons to the time-honoured problem of ruler- 
and-compass consttuction, introduced in Chapter 7. We consider the following ques¬ 
tion: for which values of n can the regular n-sided polygon be consttucted by ruler 
and compass? 

The ancient Greeks knew of constructions for 3-, 5-, and 15-gons; they also knew 
how to consUuct a 2n-gon given an n-gon, by the obvious method of bisecting the 
angles. We describe these constructions in Section 20.1. For about two thousand 
years little progress was made beyond the Greeks. If you answered Exercises 7.16 
or 7.17 you got further than they did. It seemed ‘obvious’ that the Greeks had found 
all the constructible regular polygons ... Then, on 30 March 1796, Gauss made the 
remarkable discovery that the regular 17-gon can be constructed (Figure 26). He was 
nineteen years old at the time. So pleased was he with this discovery that he resolved 
to dedicate the rest of his life to mathematics, having until then been unable to decide 
between that and the study of languages. In his Disquisition.es Arithmeticae, reprinted 
as Gauss (1966), he stated necessary and sufficient conditions for consttuctibility 
of the regular n-gon, and proved their sufficiency; he claimed to have a proof of 
necessity although he never published it. Doubtless he did: Gauss knew a proof when 
he saw one. 


20.1 What Euclid Knew 

Euclid’s Elements gets down to business straight away. The first regular polygon 
constructed there is the equilateral triangle, in Book 1 Proposition 1. Figure 27 (left) 
makes the construction fairly clear. 

The square also makes its appearance in Book 1: 

Proposition 46 (Euclid) On a given straight line to describe a square. 

In the proof, which we give in detail to illustrate Euclid’s style, notation such as 
[1,31] refers to Proposition 31 of Book 1 of the Elements. The proof is taken from 
Heath (1956), the classic edition of Euclid’s Elements. Refer to Figure 27 (right) for 
the lettering. 

Proof. Let AB be the given straight line; thus it is required to describe a square on 
the straight line AB. 


227 



228 


Regular Polygons 


H Q 0 


/ ' . V' 

' Jiliilfif Juilntz uuti A je(h^ L . 'rriU i,' 

ou UAisdUn n n 

{((Jd ' uUqnx pa .ilfc) .Kjrf x> ftnnir 

tJTum t i(Tfu r>- m <TVn tt K r.r*. r • *»»<*; J 

ha t mcr^Ti ./^ f *- a V^/< r« 

«uitTiTir 

,$ JO 

'J aTrnu.Lx, fj xy (Mint lu*A ii<it</ f|u w fu.\ipAt 
t~nt Ju. f*m<ujhpriir^n t- ‘f I i ^ c-ro n 

i'—rf 

* t £jri . /2 ^ 7 ^ 

A ^■mpunratio fum-ng, ?<i t ^u<rM.m <Ti) 

-m* i^jurM *wn tav^V ui/ + ItvO 

■^ r ' (jeU'W 

■ %*ot\ iuiu>u<t hi'ijiklihi wWd i. /n««> 

/7^a _ < '%x. 14- (><1$ 

ftufiA 

•|<i(i l/o.J *« San/Su-C A, a Cut ' 


tfrtnfrt’** ~£* writ/ /_•* + g_.<j4 
•mi**.*^ T+-i. 


' j 

€ *t /r-t, '-%AP+. £ ~;t 


l+l_ 

/+4 


r+n ^-Ur $ 


J*{“ 


7+ o_ 

/ -*-/£_ 

/ -4- ?_k 


FIGURE 26: The first entry in Gauss’s notebook records his discovery that the regular 
17-gon can be constructed. 


Let AC be drawn at right angles to the straight line AB from the point A on it [ 1, 
11], and let AD be made equal to AB; 
through the point D let DE be drawn parallel to AB, 
and through the point B let BE be drawn parallel to AD. [1,31] 

Therefore ADEB is a parallelogram; 

therefore AB is equal to DE, and AD to BE. [1, 34] 

But AB is equal to AD; 

therefore the four straight lines BA, AD, DE, EB are equal to one another; 
therefore the parallelogram ADEB is equilateral. 

I say next that it is also right-angled. 
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FIGURE 27: Left. Euclid’s construction of an equilateral triangle. Right: Euclid’s 
construction of a square. 


For, since the straight line AD falls upon the parallels AB, DE, 

the angles BAD, ADE are equal to two right angles. [1, 29] 

But the angle BAD is also right; 

therefore the angle ADE is also right. 

And in parallelogrammic areas the opposite sides and angles are equal to one 
another; [1, 34] 

therefore each of the opposite angles ABE, BED is also right. 

Therefore ADEB is right-angled. 

And it was also proved equilateral. 

Therefore it is a square; and it is described on the straight line AB. 

Q.E.F. 

□ 

Here Q.E.F. (quod erat faciendum—that which was to be done) replaces the fa¬ 
miliar Q.E.D. (quod erat demonstrandum—that which was to be proved) because 
this is not a theorem but a construction. In any case, the Latin phrase occurs in later 
translations: Euclid wrote in Greek. Now imagine you are a Victorian schoolboy— 
it always was a schoolboy in those days—trying to learn Euclid’s proof by heart, 
including the exact choice of letters in the diagrams... 

The construction of the regular pentagon has to wait until Book 4 Proposition 
11, because it depends on some quite sophisticated ideas, notably Proposition 10 of 
Book 4: To construct an isosceles triangle having each of the angles at the base 
double of the remaining one. In modern terms: construct a triangle with angles 
2n/5,2n/5,n/5. Euclid’s method for doing this is shown in Figure 28. Given AB, 
find C so that ABxBC = CA 2 . To do that, see Book 2 Proposition 11, which is itself 
quite complicated—the construction here is essentially the famous ‘golden section’, 
a name that seems to have been introduced in 1835 by Martin Ohm (Herz-Fischler 
1998, Livio 2002). Euclid’s method is given in Exercise 19.10. Next, draw the circle 
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centre A radius AB, and find D such that BD = AC. Then triangle ABD is the one 
required. 



FIGURE 28: Left. Euclid’s construction of an isosceles triangle with base angles 
An/5. Right: Euclid’s construction of a regular pentagon. Make ACD similar to tri¬ 
angle ABD in the left-hand Figure and proceed from there. 


With this shape of triangle under his belt, Euclid then constructs the regular pen¬ 
tagon: Figure 28 (right) shows his method. 

The hexagon occurs in Book 4 Proposition 15, the 15-gon in Book 4 Proposition 
16. Bisection of any angle. Book 1 Proposition 9, effectively completes the Euclidean 
catalogue of constructible regular polygons. 


20.2 Which Constructions are Possible? 

That, however, was not the end of the story. 

We derived necessary and sufficient conditions for the existence of a ruler-and- 
compass construction in Theorem 7.11. We restate it here for convenience as: 

Theorem 20.1. Suppose that K is a subfield of C, generated by points in a subset 
PCC. Let a lie in an extension L ofK such that there exists a finite series of subfields 

K = K 0 CK l C---CK r = L 

such that [Kj+\ : Kj] = 2 for j = 0,... ,r — 1. Then the point a £ C is constructible 
from P. The converse is also valid. 

There is a more useful, but weaker, version of Theorem 20.1. To prove it, we first 
need: 

Lemma 20.2. If G is a finite group and |G| = 2 r for r > 1, then its centre Z(G) 
contains an element of order 2. 
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Proof. Use the class equation (14.2). We have 


1 + C 2 + • • • + Cjt — 2' 


so some Cj is odd. By Corollary 14.12 this Cj also divides 2'\ so we must have 
\Cj\ = 1. Hence Z(G) ^ 1. Now apply Lemma 14.14. □ 

Corollary 20.3. If G is a finite group and |G| = 2 r then there exists a series 


1 = G 0 C Gi C • • • C G r = G 


of normal subgroups ofG, such that \Gj\ = 2 J for 0 <j<r. 

Proof Use Lemma 20.2 and induction. □ 

Now we can state and prove the promised modification of Theorem 20.1. 

Proposition 20.4. IfK is a subfield ofC, generated by points in a subset P CC, and 
if a lies in a normal extension L of K such that [L : K] = 2’ for some integer r, then 
cc is constructible from P. 

Proof L.K is separable since the characteristic is zero. Let G be the Galois group 
of L.K. By Theorem 12.2(1) |G| = 2 r . By Corollary 20.3, G has a series of normal 
subgroups 

1 = G 0 C Gi C • • • C G r = G 

such that |Gy | = 2 f Let Kj be the fixed field G]._j. By Theorem 12.2(3) [Kj+\ '■ Kj\ = 
2 for all j. By Theorem 20.1, CC is constructible from P. □ 


20.3 Regular Polygons 

We shall use a mixture of algebraic and geometric ideas to find those values of n 
for which the regular n- gon is constructible. To save breath, let us make the following 
(non-standard): 

Definition 20.5. The positive integer n is constructive if the regular n-gon is con¬ 
structible by ruler and compasses. 

The first step is to reduce the problem to prime-power values of n. 

Lemma 20.6. If n is constructive and m divides n, then m is constructive. If In and n 
are coprime and constructive, then mn is constructive. 

Proof. If m divides n, then we can construct a regular m-gon by joining every z/th 
vertex of a regular n- gon, where d = n/m. 
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If m and n are coprime, then there exist integers a,b such that am + bn = 1. 
Therefore 

1 1 , 1 

— = a-+b— 
mn n m 

Hence from angles 2 n/m and 2n/n we can construct 2k/ mn, and from this we obtain 
a regular mn- gon. □ 

Corollary 20.7. Suppose that n = p"/ 1 ... where p\,...,p r are distinct primes. 
Then n is constructive if and only if each p"' is constructive. 

Another obvious result: 

Lemma 20.8. For any positive integer m, the number 2 m is constructive. 

Proof. Any angle can be bisected by ruler and compasses, and the result follows by 
induction on m. □ 

This reduces the problem of constructing regular polygons to the case when the 
number of sides is an odd prime power. Now we bring in the algebra. In the complex 
plane, the set of nth roots of unity forms the vertices of a regular n-gon. Further, as 
we have seen repeatedly, these roots of unity are the zeros in C of the polynomial 

f"-l = (f-l)(f"- 1 +f "- 2 + ---+f + l) 

We concentrate on the second factor on the right-hand side: f(t) = t"~ l +t n ~ 2 H-b 

t + 1. Its zeros are the powers i/ k for 1 <k<n—\ of a primitive nth root of unity 

£ __ g 2 Ki/n 

Lemma 20.9. Let p be a prime such that p" is constructive. Let ^ be a primitive 
p n th root of unity in C. Then the degree of the minimal polynomial of f over Q is a 
power of 2. 

Proof. Take £ = e 27ri /p". The number p" is constructive if and only if we can con¬ 
struct £ from Q. Hence by Theorem 7.12 [Q(£) : Q] is a power of 2. Hence the 
degree of the minimal polynomial of £ over Q is a power of 2. □ 

The next step is to calculate the relevant minimal polynomials to find their de¬ 
grees. It turns out to be sufficient to consider p and p 2 only. 

Lemma 20.10. If p is a prime and £ is a primitive pth root of unity in C, then the 
minimal polynomial of £ over Q is 

f(t) = \+ t + ...+t p - X 

Proof. This polynomial is irreducible over Q by Lemma 3.22. Clearly £ is a zero. 
Therefore it is the minimal polynomial of £. □ 

To prove the case p 2 , we apply the method of Lemma 3.22. 
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Lemma 20.11. If p is a prime and £ is a primitive p 2 th root of unity in C, then the 
minimal polynomial of £ over Q is 

g (t) = \+ t P + ---+trtP- V > 


Proof Note that g(t) = ( t pl — 1 )/(t p — 1). Now £ p2 — 1=0 but £ p — 1 0 so 

g{Cf) = 0. It suffices to show that g(t) is irreducible over Q. As before we make 
the substitution t = 1 + u. Then 


g( l+«) 


(1 + u) p2 — 1 
(1 + u)P — 1 


and modulo p this is 


( 1 + M p2 )-1 

( 1 + M p )-1 


= M p ( p “ 1 ) 


Therefore g(l +u) = u p ^ p T + pk(u) where k is a polynomial in u over Z. From the 
alternative expression 


g(l +M ) = l + (l +M ) p + ... + (l +M ) p ( p - 1 ) 


it follows that k has constant term 1. By Eisenstein’s Criterion, g( 1 + u) is irreducible 
over Q. □ 

We can now obtain a more specific result than Lemma 15.4 for pth roots of unity 
over Q: 

Theorem 20.12. Let p be prime and let ^ be a primitive pth root of unity in C. Then 
the Galois group of Q(£) • Q is cyclic of order p— 1. 

Proof This follows the same lines as the proof of Lemma 15.4, but now we can say 
a little more. 

The zeros in C of t p — 1 are C,f where 0<j< p 1 , and these are distinct. 
These zeros form a group under multiplication, and this group is cyclic, generated by 
£. Therefore any Q-automorphism of Q(£) is determined by its effect on £. Lurther, 
Q-automorphisms permute the zeros of t p — 1 . Hence any Q-automorphism of Q(C) 
has the form 

aj : C ^ C j 

and is uniquely determined by this condition. 

We claim that every a ; - is, in fact, a Q-automorphism of Q(C)- The with 

j > 0 are the zeros of 1 +H-b f p . This polynomial is irreducible over Q by 

Lemma 3.22. Therefore it is the minimal polynomial of any of its zeros, namely 
where 1 <j<P~ T By Proposition 11.4, every a ; - is a Q-automorphism of Q(C)> 
as claimed. 

Clearly a,a, = a ;/ , where the product ij is taken modulo p. Therefore the Galois 
group of Q(£) : Q is isomorphic to the multiplicative group Z*. This is cyclic by 
Corollary 19.9. □ 
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We now come to the main result of this chapter. 


Theorem 20.13 (Gauss). The regular n-gon is constructible by ruler and compasses 
if and only if 

n = 2 r pi. ,.p s 

where r and s are integers > 0 , and p i,... ,p s are distinct odd primes of the form 

Pj = 2 2 °' + l 

for positive integers rj. 

Proof Let n be constructive. Then n = 2 r p" l n .. ,p™ s where pi,...,p s are distinct odd 
primes. By Corollary 20.7, each py' is constructive. If nij > 2 then pj is constructive 
by Theorem 20.1. Hence the degree of the minimal polynomial of a primitive pj th 
root of unity over Q is a power of 2 by Lemma 20.9. By Lemma 20.11, Pjipj — 1) 
is a power of 2, which cannot happen since pj is odd. Therefore mj = 1 for all j. 
Therefore pj is constructive. By Lemma 3.22 

Pj- l=2 s t 

for suitable sj. Suppose that sj has an odd divisor a > 1, so that Sj = ab. Then 

pj=( 2 & r+i 


which is divisible by 2 b + 1 since 

f« + l = (f + l)(f «- 1 -/'- 2 + ... + l) 
when a is odd. So pj cannot be prime. Hence Sj has no odd factors, so 

Sj = 2 r * 


for some r ; > 0 . 

This establishes the necessity of the given form of n. Now we prove sufficiency. 
By Corollary 20.7 we need consider only prime-power factors of n. By Lemma 20.8, 
2 r is constructive. We must show that each pj is constructive. Let £ be a primitive 
Pj\h root of unity. By Theorem 20.12 

[Q(C):Q]=p ; -l= 2 s t 

Now Q(£) is a splitting field for /(f) = 1 4 -hf ^" 1 over Q, so that Q(C) : Q 

is normal. It is also separable since the characteristic is zero. By Lemma 15.5, the 
Galois group T(Q(£) : Q) is abelian, and by Theorem 20.12 or an appeal to the 
Galois correspondence it has order 2 S J. By Proposition 20.4, £ £ C is constructible. 

□ 
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20.4 Fermat Numbers 

The problem of finding all constructible regular polygons now reduces to number 
theory, and there the question has a longer history. In 1640 Pierre de Fermat won¬ 
dered when 2 * + 1 is prime, and proved that a necessary condition is for k to he a 
power of 2. Thus we are led to: 

Definition 20.14. The nth Fermat number is F„ = 2 2 " + 1. 

The question becomes: when is F„ prime? 

Fermat noticed that Fq = 3. 1) = 5. f 2 = 17,F 3 = 257, and F 4 = 65537 are all 
prime. He conjectured that F n is prime for all n, but this was disproved by Euler in 
1732, who proved that F$ is divisible by 641 (Exercise 20.5). Knowledge of factors of 
Fermat numbers is changing almost daily, thanks to the prevalence of fast computers 
and special algorithms for primality testing of Fermat numbers: see References under 
‘Internet’. At the time of writing, the largest known composite Fermat number was 
F 3329780 , with a factor 193.2 3329782 + 1. This was proved by Raymond Ottusch in July 
2014 as a contribution to PrimeGrid’s Proth Prime Search. At that time, 277 Fermat 
numbers were known to be composite. 

No new Fermat primes have been found, so the only known Fermat primes are 
still those found by Fermat himself: 2, 3, 5, 17, 257, and 65537. We sum up the 
current state of knowledge as: 

Proposition 20.15. If p is a prime, then the regular p-gon is constructible for p = 
2,3,5,17,257,65537. 


20.5 How to Draw a Regular 17-gon 

Many constructions for the regular 17-gon have been devised, the earliest pub¬ 
lished being that of Huguenin (see Klein 1913) in 1803. For several of these con¬ 
structions there are proofs of their correctness which use only synthetic geometry 
(ordinary Euclidean geometry without coordinates). A series of papers giving a con¬ 
struction for the regular 257-gon was published by F.J. Richelot (1832), under one 
of the longest titles I have ever seen. Bell (1965) tells of an over-zealous research 
student being sent away to find a construction for the 65537-gon, and reappearing 
with one twenty years later. This story, though apocryphal, is not far from the truth; 
Professor Hermes of Fingen spent ten years on the problem, and his manuscripts are 
still preserved at Gottingen. 

One way to construct a regular 17-gon is to follow faithfully the above theory, 
which in fact provides a perfectly definite construction after a little extra calcula¬ 
tion. With ingenuity it is possible to shorten the work. The construction that we now 
describe is taken from Hardy and Wright (1962). 
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Our immediate object is to find radical expressions for the zeros of the polyno¬ 
mial 

t n _ i 

-- = t 16 + •••+* + 1 ( 20 . 1 ) 

t- 1 

over C. We know the zeros are where £ = er m / 11 and 1 < k < 16. To simplify 
notation, let 

6 =2n/\l 

so that C, k = cos kO + isinkd. 

Theorem 20.12 for n = 17 implies that the Galois group r(Q(£) : Q) consists of 
the Q-automorphisms Jj defined by 

rj(Q = V i <;<16 

and this is isomorphic to the multiplicative group Zf 7 . By Theorem 19.8 Zj 7 is cyclic 
of order 16. 

Galois theory now implies that £ is constructible. In fact, there must exist a gen¬ 
erator a for Zf 7 . Then a 16 = 1 and no smaller power of a is 1. Consider the series 
of subgroups 

1 = (a 16 ) o (a 8 ) < (a 4 ) < (a 2 ) < (a) = Zj 7 (20.2) 

The Galois correspondence leads to a tower of subfields from Q to Q(£) in which 
each step is an extension of degree 2. By Theorem 7.11, £ is constructible, so the 
regular 17-gon is constructible. 

To convert this to an explicit construction we must find a generator for Z| 7 . Ex¬ 
perimenting with small values, a = 2 is not a generator (it has order 8 ), but a = 3 is 
a generator. In fact, the powers of 3 modulo 17 are: 


m 

0 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

13 

14 

15 

3 m 

1 

3 

9 

10 

13 

5 

15 

11 

16 

14 

8 

7 

4 

12 

2 

6 


Motivated by (20.2), define 

xi = c + C 9 + C 13 + C 15 + C 16 + C 8 + C 4 + C 2 
* 2 = C 3 + C 10 + C 5 + ?h + C14 + C7 + C I 2 + C 6 
yi =c+c 13 +c 16 +c 4 

y 2 = C 9 + C 15 + C 8 + C 2 
B = C 3 + C 5 + C 14 + C 12 
y 4 = C 10 + C n + C 7 + C 6 


By definition, these lie in various fixed fields in the aforementioned tower. Now 

f*+ £!?-* = 2 cos*e (20.3) 
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for k = l, ti .., 16, so 

X] = 2(cos0 + cos80 + cos40 + cos20) 
X 2 = 2(cos30 + cos 70 + cos 50 + cos 60) 
}’i = 2(cos0 +cos40) 
y 2 = 2(cos80 +cos 20 ) 
y 3 = 2(cos30 +cos50) 
y <4 = 2(cos70 + cos60) 


Equation (20.1) implies that 
Now (20.4) and the identity 


X\ +X2 = — 1 


2 cos md cos nO = cos(m + n )0 +cos(w — n)Q 


imply that 


XlX2 = 4(xi +X2) = -4 


using (20.3). Hence x\ and X 2 are zeros of the quadratic polynomial 


t 2 +t -4 


Further, x\ > 0 so that x\ > X 2 - By further trigonometric expansions, 

yi+yi=x\ yiy 2 = -i 


and vi ,>’2 are the zeros of 

t 2 — x\t — 1 

Further, y\ > y 2 ■ Similarly, V 3 and 34 are the zeros of 

t 2 — X2t — 1 


and y 3 > y 4 . Now 


2cos0+2cos40 = y\ 
4cos 0 cos40 = 2cos 50 +2 cos 30 = 3^3 


so 

are the zeros of 


Zi=2cos0 Z2=2cos40 
t 2 -yit+y 3 


(20.4) 


(20.5) 


( 20 . 6 ) 

(20.7) 


( 20 . 8 ) 


and zi > Z 2 - 
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Solving the series of quadratics (20.5-20.8) and using the inequalities to decide 
which zero is which, we obtain 

cos 0 = 7-7 ( — 1 + V^17 -|- \/34 — 2\/T7 (20.9) 

16 \ 

+ ^68 + 12717—16^34 + 2^- 2(1 - %/l7)^34-2^^) 

where the square roots are the positive ones. 

From this we can deduce a geometric construction for the 17-gon by constructing 
the relevant square roots. This procedure is animated in an iPad app, Stewart (2014), 
and can also be found on the web. By using greater ingenuity it is possible to obtain 
an aesthetically more satisfying construction. The following method (Figure 29) is 
due to Richmond (1893). 

Let 0 be the smallest positive acute angle such that tan40 = 4. Then 0 , 20 . and 
40 are all acute. Expression (20.5) can be written 

r +4t cot40 —4 


whose zeros are 

2 tan 20 

Hence 

x\ = 2 tan 20 

This implies that 

yi = tan ^0 + ^j y 2 = tan ^0 


— 2 cot 20 
x 2 = — 2 cot 20 

7 ) y 3 =tan 0 y 4 = -cot 0 


so that 


2(cos30+cos50) = tan0 


4cos30cos50 = tan 



In Figure 29, let OA, OB be two perpendicular radii of a circle. Make OI = 4 OB and 
ZOIE = |ZOIA. Find F on AO produced to make ZEIF = |. Let the circle on AF 
as diameter cut OB in K, and let the circle centre E through K cut OA in N 3 and N 5 
as shown. Draw N 3 P 3 and N 5 P 5 perpendicular to OA. Then ZOIA = 40 and ZOIE 
= 0. Also 


2(cosZAOP 3 +COSZAOP 5 ) 


o ON 3 — ON 5 
OA 

OE OE 
4- 1 -= 

OA OI 


tan 0 
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and 


4 cos ZAOP 3 cos ZAOP 5 = —4 


= -4- 


ON 3 x ON 5 
OAxOA 
.OK 2 


= -4 


OA- 

OF 

OA 


OF / K 
= ~ 01 = ,an o — 4 

Comparing these with equation (17.8) we see that 

ZAOP 3 = 30 ZAOP s =5e 


Hence A, P 3 , P 5 are the zeroth, third, and fifth vertices of a regular 17-gon inscribed 
in the given circle. The other vertices are now easily found. 



N,F O E N, 


FIGURE 29: Richmond’s construction for a regular 17-gon. 


In Chapter 21 we return to topics associated with regular polygons, especially so- 
called cyclotomic polynomials. We end that chapter by investigating the construction 
of regular polygons when an angle-trisector is permitted, as well as the traditional 
ruler and compass. 


EXERCISES 


20.1 Using only the operations ‘ruler’ and ‘compass’, show how to draw a parallel 
to a given line through a given point. 
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20.2 Verify the following approximate constructions for regular n-gons found by 
Oldroyd (1955): 

(a) 1-gon. Construct cos -1 44 ^ giving an angle of approximately 2%/l. 

(b) 9 -gon. Construct cos " 1 5v ^ ) " 1 . 

(c) 11 -gon. Construct cos " 1 * and cos " 1 J, and take their difference. 

(d) 13 -gon. Construct tan " 1 1 and tan " 1 Hrn anc * ta ^ e t ^ eir difference. 

20.3 Show that for n odd the only known constructible n-gons are precisely those 
for which n is a divisor of 2 32 — 1 = 4294967295. 

20.4 Work out the approximate size of 7*382449. which is known to be composite. 
Explain why it is no easy task to find factors of Fermat numbers. 

20.5 Use the equations 

641 =5 4 + 2 4 = 5.2 7 + l 
to show that 641 divides F$. 

20.6 Show that 

F n +i = 2 + F n F n -i ...Fo 

and deduce that if m f n then F m and F„ are coprime. Hence show that there 
are infinitely many prime numbers. 

20.7 List the values of n < 100 for which the regular n- gon can be constructed by 
ruler and compasses. 

20.8 Verify the following construction for the regular pentagon. 

Draw a circle centre O with two perpendicular radii OPo, OB. Let D be the 
midpoint of OB, join PoD. Bisect ZODP () cutting OPo at N. Draw NP| per¬ 
pendicular to OPo cutting the circle at Pj. Then Po and Pi are the zeroth and 
first vertices of a regular pentagon inscribed in the circle. 

20.9 Euclid’s construction for an isosceles triangle with angles 4zr/5,4zr/5,27r/5 
depends on constructing the so-called golden section: that is. To construct a 
given straight line so that the rectangle contained by the whole and one of 
the segments is equal to the square on the other segment. The Greek term was 
‘extreme and mean ratio’. In Book 2 Proposition 11 of the Elements Euclid 
solves this problem as in Figure 30. 

Let AB be the given line. Make ABDC a square. Bisect AC at E, and make EF 
= BE. Now find H such that AH = AF. Then the square on AH has the same 
area as the rectangle with sides AB and BH, as required. 

Prove that Euclid was right. 

20.10 Mark the following true or false. 
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Fi— G 



VJ 

H 




C K D 

FIGURE 30: Cutting a line in extreme and mean ratio. 


(a) 2" + 1 cannot be prime unless n is a power of 2. 

(b) If n is a power of 2 then 2” + 1 is always prime. 

(c) The regular 771-gon is constructible using ruler and compasses. 

fd) The regular 768-gon is constructible using ruler and compasses. 

(e) The regular 51-gon is constructible using ruler and compasses. 

(f) The regular 25-gon is constructible using ruler and compasses. 

(g) For an odd prime p, the regular p 2 -gon is never constructible using ruler 
and compasses. 

(h) If n is an integer > 0 then a line of length yTz can always be constructed 
from Q using ruler and compass. 

(i) If n is an integer > 0 then a line of length \fn can always be constructed 
from Q using ruler and compass. 

(j) A point whose coordinates lie in a normal extension of Q whose degree 
is a power of 2 is constructible using ruler and compasses. 

2 

(k) If p is a prime, then t p — 1 is irreducible over Q. 
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Chapter 21 


Circle Division 


To halt the story of regular polygons at the stage of ruler-and-compass constructions 
would leave a small but significant gap in our understanding of the solution of poly¬ 
nomial equations by radicals. Our definition of ‘radical extension’ involves a slight 
cheat, which becomes evident if we ask what the expression of a root of unity looks 
like. Specifically, what does the radical expression of the primitive 11th root of unity 


„ 2n . 2n 

Ci i = cos-h ism — 

S 11 11 

look like? 

As the theory stands, the best we can offer is 


v'T 


( 21 . 1 ) 


which is not terribly satisfactory, because the obvious interpretation of \/T is 1, not 
Ci i. Gauss’s theory of the 17-gon hints that there might be a more impressive answer. 
In place of \/T Gauss has a marvellously complicated system of nested square roots, 
which we repeat from equation (20.9): 


cos -j-y = tt( ~ 1 + v / 17 + ^34 - 2 a/17 


+ 68 + 12 ^- 16^34 + 2 ^ 17 - 2(1 - \fl7) ^ 34 - 2 ^^ 

with a similar expression for sin jS, and hence an even more impressive formula for 

C 17 =cos^+tsin^y. 

Can something similar be done for the 11th root of unity? For all roots of unity? 
The answer to both questions is ‘yes’, and we are getting the history back to front, 
because Gauss gave that answer as part of his work on the 17-gon. Indeed, Vander¬ 
monde came very close to the same answer 25 years earlier, in 1771, and in particular 
he managed to find an expression by radicals for Cn that is less disappointing than 
(21.1). He, in turn, built on the epic investigations of Lagrange. 

The technical term for this area is ‘cyclotomy’, from the Greek for ‘circle cut¬ 
ting’. In particular, pursuing Gauss’s and Vandermonde’s line of enquiry will lead 
us to some fascinating properties of the ‘cyclotomic polynomial’ <4>j(f), which is the 
minimal polynomial over Q of a primitive d/th root of unity in C. 
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21.1 Genuine Radicals 

Of course, we can ‘solve’ the entire problem at a stroke if we define \/l to be the 
primitive nth root of unity 

2n . 2n 

cos- b i sin — 

n n 

instead of defining it to be 1. In a sense, this is what Definition 15.1 does. However, 
there is a better solution, as we shall see. What makes the above interpretation of \/T 
unsatisfactory? Consider the typical case of £17 = v 7 1. The minimal polynomial of 
£17 is not t 17 — 1 , as the notation sf \ suggests; instead, it has degree 16, being equal 
to 

f 16 +f 15 H-f + 1 

It would be reasonable to seek to determine the zeros of this 16th degree equation us¬ 
ing radicals of degree 16 or less, but a 17th root seems rather out of place. Especially 
since we know from Gauss that in this case (nested) square roots are enough. 

However, that is a rather special example. What about other nth roots of unity? 
Can they also be expressed by what we might informally call ‘genuine’ radicals, 
those not employing the %/! trick? (We pin down this concept formally in Defini¬ 
tion 21.1.) Classically, the answer was found to be ‘yes’ for 2 < n < 10, as we now 
indicate. 

When n = 2, the primitive square root of unity is — 1. This lies in Q, so no radicals 
are needed. 

When n = 3, the primitive cube roots of unity are solutions of the quadratic 
equation 

t 2 + t +1=0 

and so are of the form ft), or where 


ft) = - 


1 

2 



involving only a square root. 

When n = 4, a primitive 4th root of unity is i, which again can be represented 
using only a square root, since i = \/ - 1 . 

When n = 5, we have to solve 


t 4 +t 3 +t 2 +t+ 1 =0 


( 21 . 2 ) 


We know from Chapter 18 that any quartic can be solved by radicals; indeed only 
square and cube roots are required (in part because = \J\fx). But we can do 
better. There is a standard trick that applies to equations of even degree that are 
palindromic —the list of coefficients is symmetric about the central term. We en¬ 
countered this trick in Exercises 15.4 and 15.5: express the equations in terms of a 
new variable 


1 

li — t - 1 — 
t 


(21.3) 
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so 


Explicitly, we get four zeros: 



t 


- 1 ± V 5 ± \/-10 ± 2^5 


(21.4) 


with independent choices of the first two ± signs, and the third equalling the first. So 
we can express primitive 5th roots of unity using nothing worse than square roots. 

Continuing in this way, we can find a radical expression for a primitive 6 th root 
of unity (it is —CO); a primitive 7th root of unity (use the t + 1/f trick to reduce to 
a cubic); a primitive 8 th root of unity (\/i is one possibility, is perhaps better); 

a primitive 9th root of unity and a primitive 10th root of unity (—£ 5 ). The 

first case that baffled mathematicians prior to 1771 was therefore the primitive 11th 
root of unity, which leads to a quintic if we try the t - 1 /t trick. But in that year, 
Vandermonde obtained the explicit radical expression 
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He stated that his method would work for any primitive nth root of unity, but he 
did not give a proof. That was supplied by Gauss in 1796, with a gap in the proof, 
see below, and it was published in 1801 in his Disquisition.es Arithmeticae. It is not 
known whether Gauss was aware of Vandermonde’s pioneering work. 


21.2 Fifth Roots Revisited 

Before proving a version of Gauss’s theorem on the representability of roots 
of unity by genuine radicals, it helps to have an example. We can explain Vander¬ 
monde’s approach in the simpler case n = 5, where explicit calculations are not too 
lengthy. 

As before, we want to solve 

f 4 +f 3 +/ 2 +f+l =0 
by radicals. We know that the zeros are 

c c 2 c 3 c 4 

where £ = cos ^ + /sin =7. The exponents 1,2,3,4 can be considered as elements 
of the multiplicative group of the field Z 5 . By Theorem 20.12 the Galois group of 
Q(£) : Q consists of the Q-automorphisms 

07: C ^ C ; 1 < j < 4 

The Galois group is therefore isomorphic to is Z2j, which is cyclic of order 4 by 
Theorem 19.8. Experiment quickly shows that it is generated by the element 2 (mod 
5). Indeed, modulo 5 the powers of 2 are 

2° = 1 2 1 = 2 2 2 = 4 2 3 = 3 (21.5) 

Hilbert’s Theorem 90, Theorem 18.18, leads us to consider the expression 

ai = C + *C 2 -C 4 -*'C 3 

and compute its fourth power. We find (suppressing some details) that 

« 2 = -(i+20(C-C 2 + C 4 -C 3 ) 

so, squaring again, 

af = -15 + 20/ 

Therefore we can express cq by radicals: 


cq = <7—15 + 20/ 
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We can play a similar game with 


« 3 = C-'C 2 -C 4 + *C 3 


to get 

a 3 = v^—15 — 20/ 

The calculation of aj* also draws attention to 

a 2 = C-C 2 + C 4 -C 3 

and shows that a| = 5, so 

a 2 = a/5 

Summarising: 

«0 = C + C 2 + C 4 + C 3 = -1 

a, = C + *C 2 -C 4 -*C 3 = 15 + 20; 

a 2 = C — C 2 + C 4 — C 3 = V5 

a 3 = C-*C 2 -C 4 + *C 3 = v^—15 —20/ 

Thus we find four linear equations in £, £ 2 , \ C 4 . These equations are independent, 
and we can solve them. In particular, 

c<o + ai + a 2 + a 3 


is equal to 

C(i +1 +1 +1) + C 2 (i + / -1 - /) + C 4 (i -1 +1 -1) + C 3 (i - i -1+0 = 4C 

Therefore 

C= \ 1 — y/5 + \!\/—15 +20; + ^v/—15 — 20/ 

This expression is superficially different from (21.4), but in fact the two are equiva¬ 
lent. Both use nothing worse than square roots. 

This calculation is too remarkable to be mere coincidence. It must work out nicely 
because of some hidden structure. What lies behind it? 

The general idea behind Vandermonde’s calculation, as isolated by Gauss, is the 
following. Recall Definition 21.7, which introduces the group of units Z* of the ring 
Z„. This consists of all elements that have a multiplicative inverse (mod ;;), and it is 
a group under multiplication. When;; is prime, this consists of all nonzero elements. 
In general, it consists of those elements that are prime to 

The multiplicative group is cyclic of order 4, and the number 2 (modulo 5) is a 
generator. It has order 4 in Zj. The complex number ; is a primitive 4th root of unity, 
so ; has order 4 in the multiplicative group of 4th roots of unity, namely 1,;, —1, —;. 
These two facts conspire to make the algebra work. 

To see how, we apply a little Galois theory—a classic case of being wise after the 
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event. By Theorem 21.9, the Galois group T of Q(£) : Q has order 4 and comprises 
the Q-automorphisms generated by the maps 

Pk- C ^ C A 

for k= 1,2,3,4. The group F is isomorphic to Z( by the map i—► k (mod 5). There¬ 

fore P 2 has order 4 in T, hence generates T, and F is cyclic of order 4. 

The extension is normal, since it is a splitting field for an irreducible polynomial, 
and we are working over C so the extension is separable. By the Galois correspon¬ 
dence, any rational function of C that is fixed by p^ is in fact a rational number. 
Consider as a typical case the expression (X\ above. Write this as 

ai = C+te(C)+^ 2 (C)+i^ 3 (0 


Then 

P 2 (« 1 ) = p 2 (C) + *pf (C) + * 2 p| (C) + * 3 C 
since p^(C) = C- Therefore 

p 2 {a l ) = r l a l 

SO 

p 2 (af) = (i _1 a i ) 4 = a\ 

Thus lies in the fixed field of p 2 , that is, the fixed field of T, which is Q... 

Hold it. 

The idea is right, but the argument has a flaw. The explicit calculation shows that 
af = —15 + 20;, which lies in Q(;), not Q. What was the mistake? The problem is 
that a\ is not an element of Q(£). It belongs to the larger field Q(£)(;), which equals 
Q(;, £). So we have to do the Galois theory for Q(;, Q : Q, not Q(£) : Q. 

It is fairly straightforward to do this. Since 4 and 5 are coprime, the product 
E, = is a primitive 20th root of unity. Moreover, E, 5 = i and g 16 = ^. Therefore 
Q(;, £) = Q(£). Since 20 is not prime, we do not know that this group is cyclic, so 
we have to work out its structure. In fact, it is the group of units Z ^ 0 of the ring Z20, 
which is isomorphic to Z 2 x Z 4 , not Zs. By considering the tower of fields 

QCQ(;)CQ© 

and using the structure of Zjq, it can be shown that the Galois group of Q(i®) : Q(;) 
is the subgroup of Z^ 0 isomorphic to Z 4 , generated by the Q(;)-automorphism p 2 
that sends £ to £ 2 and fixes Q (;). We prove a more general result in Theorem 21.3 
below. 

Having made the switch to Q(^), the above calculation shows that af lies in 
the fixed field of the Galois group r(Q(^) : Q(;)). This field is Q(;), because the 
extension is normal and separable. So without doing the explicit calculations, we can 
see in advance that af must lie in Q(;). The same goes for 0 $, (X 3 , and (trivially) 0 !q . 
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21.3 Vandermonde Revisited 

Vandermonde was very competent, but a bit of a plodder; he did not follow up 
his idea in full generality, and thereby missed a major discovery. He could well have 
anticipated Gauss, possibly even Galois, if he had found a proof that his method was 
a completely general way to express roots of unity by genuine radicals, instead of 
just asserting that it was. 

As preparation, we now establish Vandermonde’s main point about the primitive 
11th roots of unity. Any unproved assertions about Galois groups will be dealt with 
in the general case, see Section 21.4. Let £ = £n. Vandermonde started with the 
identity 

C 10 + C 9 + --- + C + i = o 

and played the u = £ +1 /£ trick to reduce the problem to a quintic, but with hindsight 
this step is not necessary and if anything makes the idea more obscure. Introduce a 
primitive 10th root of unity 0, so that 0£ is a primitive 110th root of unity. Consider 
the field extension Q(0£) : Q(0), which turns out to be of degree 10, with a cyclic 
Galois group of order 10 that is isomorphic to Z*j. A generator for Zj, is readily 
found, and turns out to be the number 2, whose successive powers are 

1,2,4,8,5,10,9,7,3,6 

Therefore Y = T(Q(0£) : Q(0)) consists of the Q(0)-automorphisms pk, for k = 
1 ,..., 10, that map 

C £* 0 I—>• 0 

Let / be any integer, 0 < / < 9, and define 

a, = f + e'f 1 + e-'f 4 H-h e 9 'f 6 

= E’_o eJ 'Z ’ 

Consider the effect of p 2 , which sends £ i—»• and fixes 0. We have 

P2(«/) = £ 0- , 'C 2 ' " = O-'a, 
f=o 

SO 

p 2 (a/°) = 0~ lo, a/° = a/ 0 

and a/ 0 lies in the fixed field of T, which is Q(0)). Thus there is some polynomial 
//(0), of degree < 9 over Q, with 

«/ 10 = / z ( 0 ) 

With effort, we can compute //(0) explicitly. Short cuts help. At any rate. 


«/= l VW) 


( 21 . 7 ) 
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We already know how to express 9 by genuine radicals since it is a primitive 10th 
root of unity, so we have expressed a/ by radicals—in fact, only square roots and 
fifth roots are needed, since = \/\f and fifth roots of unity require only square 
roots. 

Finally, the ten equations (21.6) for the a/ can be interpreted as a system of 10 
linear equations for the powers £ 2 ,..., £ 1(1 over C. These equations are indepen¬ 
dent, so the system can be solved. Indeed, using elementary properties of 10th roots 
of unity, it can be shown that 


In particular, 

( -ToH 

Thus we have expressed £i i in terms of radicals, using only square roots and fifth 
roots. 

Vandermonde’s answer also uses only square roots and fifth roots, and can be 
deduced from the above formula. Because he used a variant of the above strategy, 
his answer does not immediately look the same as ours, but it is equivalent. To go 
beyond Vandermonde, we must prove that his method works for all primitive nth 
roots of unity. This we now establish. 


21.4 The General Case 

The time has come to define what we mean by a ‘genuine’ radical expression. 
Recall from Definition 8.12 that the radical degree of the radical nj is n, and define 
the radical degree of a radical expression to be the maximum radical degree of the 
radicals that appear in it. 

Definition 21.1. A number a £ C has a genuine radical expression if a belongs to 
a radical extension of Q formed by successive adjunction of Ath roots of elements j3, 
where at every step the polynomial t k — j3 is irreducible over the field to which the 
root is adjoined. 

This definition rules out \/T as a genuine radical expression for £i i, but it permits 
\/ -1 as a genuine radical expression for i, and \/2 as a genuine radical expression 
for—well, n. 

Our aim is to prove a theorem that was effectively stated by Vandermonde, and 
proved in full rigour (and greater generality, but we have to stop somewhere) by 
Gauss. The name ‘Vandermonde-Gauss Theorem’ is not standard, but it ought to be, 
so we shall use it. 
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Theorem 21.2 (Vandermonde-Gauss Theorem). For any n > 1, any nth root of 
unity has a genuine radical expression. 

The aim of this section is to prove the Vandermonde-Gauss Theorem. In fact 
we prove something distinctly stronger: see Exercise 21.3. We prove the theorem by 
induction on n. It is easy to see that the induction step reduces to the case where n 
is prime and the nth root of unity concerned is therefore primitive, because if n is 
composite we can write it as n = pq where p is prime, and yr = 

Let n p he prime and focus attention on a primitive pth root of unity £ p , which 
for simplicity we denote by £. In trigonometric terms, 

„ 2n 2 k 

q = cos-b i sin — 

P P 


but we do not actually use this formula. 

We already know the minimal polynomial of £ over Q, from Lemma 3.22. It is 

m (t) = t p ~ l + t p - 1 + • • • +t + 1 = t —^~ 
w t-1 


Let 


0 = cos 


2n 

V 1 1 


2k 

+ i sin- 

P~ 1 


be a primitive (p — 1 )th root of unity. Since p — 1 is composite (except when p 
the minimal polynomial of 0 over Q is not equal to 


c{t)=t p - 2 +t p - 3 


t p - 1 - 1 
t- 1 


2,3) 


but instead it is some irreducible divisor of c(t). 

We work not with Q(£) : Q, but with : Q. Since p,p — 1 are coprime, 

this extension is the same as 

Q(0C):Q 

where is a primitive pip — l)th root of unity. A general element of Q (6£) can 
be written as a linear combination over Q(0) of the powers 1, £ 2 ,..., £ p ~ 2 . It 

is convenient to throw in £ p ~ l as well, but now we must always bear in mind the 
relation 1 + £ + £ 2 H-b £ p_1 = 0. 

We base the deduction on the following result, which we prove in Section 21.7 
to avoid technical distractions. 


Theorem 21.3. The Galois group o/Q(0£) : Q(0) is cyclic of order p—1. It com¬ 
prises the Q ( 0) -automorphisms of the form pj, (7 = 1,2,.../? — 1), where 

Pi: C -> C j 

040 
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The main technical issue in proving this theorem is that although we know that 
£, £ 2 ,..., 2 are linearly independent over Q, we do not (yet) know that they are 

linearly independent over Q(0). Even Gauss omitted the proof of this fact from his 
discussion in the Disquisition.es Arithmeticae, but that may have been because to him 
it was obvious. He never published a proof of this particular fact, though he must 
have known one. So in a sense the first complete proof should probably be credited 
to Galois. 

Assuming Theorem 21.3, we can follow Vandermonde’s method in complete gen¬ 
erality, using a few simple facts about roots of unity. 


Proof of the Vandermonde-Gauss Theorem. We prove the theorem by induction on 
n. The cases n = 1,2 are trivial since the roots of unity concerned are 1,-1. As 
explained above, the induction step reduces to the case where n is prime and the nth 
root of unity concerned is therefore primitive. Throughout the proof it helps to bear 
in mind the above examples when n = 5,11. 

We write n = p to remind us that n is prime. Let £ he a primitive pth root of 
unity and let 0 be a primitive (p — 1 )th root of unity as above. Then 0 £ is a primitive 
p{p — l)th root of unity. 

By Theorem 21.3, the Galois group of Q (GQ : Q is isomorphic to Z* and is 
thus cyclic of order p — 1 by Corollary 19.9. It comprises the automorphisms pj 
for j = \p 1. Since Z* is cyclic, there exists a generator a. That is, every 
j € Z* p can be expressed as a power j = a 1 of a. Then pj = p^, so p a generates 

r = r(Q( 0 £) :Q( 0 )). 

By Theorem 21.3 and Proposition 17.18, Q(0£) : Q(0) is normal and separable, 
so in particular the fixed field of T is Q(0) by Theorem 12.2(2). Since p„ generates 
r, any element of Q( 0 £) that is fixed by p„ must lie in Q( 0 ). 

We construct elements fixed by p„ as follows. Define 


a, = C + 0 / C“ + 0 2/ C a2 + --- + 0 ( P“ 2)/ C flP 2 

P~ 2 

= £ e jl c al 

7=0 


( 21 . 8 ) 


for 0 < / < p — 2. Then 

PM^OJ 1 ^ 1 =0~ l a, 

7=0 


Therefore 


Pairf = (0 l cci) p 1 = ( 9 P l a p 1 = 1 • a p 1 =aj 


P -1 - „P - 1 


so a'l 1 is fixed by p„, hence lies in 


. Say 


a p - l =p,&®{9) 


Therefore 


a,= p -ypi (0 < / < p — 2 ) 
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Recall (Exercise 21.5) the following property of roots of unity: 


i + e j + e 2 j + ... + e (p-2)j = }p 1 ^ j ~° 

[0 if — 2 


Therefore, from (21.8), 

C = ^-[oo + aH- \-a p - 2 ] 

= ^y[ '■ Vh+ p -Vh + • • • + P ~V^\ 

which expresses £ by radicals over Q(0). 

Now, 6 is a primitive {p — 1 )th root of unity, so by induction 0 is a radical expres¬ 
sion over Q of maximum radical degree < p — 2. Each pi is also a radical expression 
over Q of maximum radical degree < p — 2 , since Pi is a polynomial in d with ra¬ 
tional coefficients. (Actually we can say more: if p > 2 then /? — 1 is even, so the 
maximum radical degree is max(2, ( p — l)/2). Note that when p = 3 we require a 
square root, but (p— l)/2 = 1. See Exercise 21.3.) 

Substituting the rational expressions in (21.9) we see that £ is a radical expres¬ 
sion over Q of maximum radical degree < p — 1. (Again, this can be improved to 
max(2, (p— l)/2) for p > 2, see Exercise 21.3.) 

Therefore, in particular, (21.9) yields a genuine radical expression for £ accord¬ 
ing to the definition, and the Vandermonde-Gauss Theorem is proved. □ 


21.5 Cyclotomic Polynomials 

In order to fill in the technical gap we first need: 

Theorem 21.4. Any two primitive nth roots of unity in C have the same minimal 
polynomial over Q. 

We proved this in Lemma 20.10 when n is prime, but the composite case is more 
difficult. Before starting on the proof, some motivation will be useful. 

Consider the case n = 12. Let £ = e 77 '-' 6 be a primitive 12th root of unity. We can 
classify its powers according to their minimal power d such that (&) d = 1. That 
is, we consider when they are primitive t/th roots of unity. It is easy to see that in this 
case the primitive d/th roots of unity are: 


d = 1 

1 

d = 2 

C 6 (=-1) 

d = 3 

c 4 ,c 8 (= 

II 

"^3 

c 3 ,? 9 (= «,-«■) 

v© 

II 

“^3 

C 2 ,C 10 (= -CO,-CO 2 

d= 12 

c,c 5 ,c 7 ,c n 
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We can factorise f 12 — 1 by grouping corresponding zeros: 

f 12 — 1 = (t — l)x 
(t-C 6 )x 
(t-C 4 )(^C 8 )x 
(t-C 3 )(^C 9 )x 
(/-C 2 )(/-C 10 )x 

(/-C)(/-C 5 )(/-C 7 )(/-C n ) 

which simplifies to 

? 12 -l = ( ? -l)( ?+ l)( ? 2 + ? + l)( ? 2 + l)( ? 2 -? + l)F(f) 

where 

F(0 = (f-C)(?-C 5 )(?-O(f-C 11 ) 

whose explicit form is not immediately obvious. One way to work out F(t) is to use 
trigonometry (Exercise 21.4). The other is to divide t 12 — 1 by all the other factors, 
which leads rapidly to 

F(t)=t 4 -t 2 + 1 

If we let be the factor corresponding to primitive d/th roots of unity, we have 
proved that 

t^~ — 1 Oi < 4 > 2 < 4 > 3 < 4 > 4 < 4 > 6 < J ) ]2 

Our computations show that every factor <4> ; lies in Z\t], In fact, it turns out that the 
factors are all irreducible over Z. This is obvious for all factors except f 4 - 1 2 + 1, 
where it can be proved by considering the factorisation ( t — £)(/ — £ 5 )(t — £ 7 )(f ~ 
£*') (Exercise 21.5). 

This calculation generalises, as the following proof (eventually) shows. 

Proof of Theorem 21.4. Factorise t n — 1 into monic irreducible factors in Q[f], By 
Corollary 3.18 these actually lie in Z[t\. By the derivative test, t n —l has no multiple 
zeros. So each zero is a zero of exactly one of these factors, and that factor is its 
minimal polynomial. Hence two zeros of t n — 1 have the same minimal polynomial if 
and only if they are zeros of the same irreducible factor. Denote the factor of which 
an nth root of unity e is a zero by where the square brackets remind us that 

different e can be zeros of the same polynomial. 

We claim that if p is any prime that does not divide n, then e and e p have the 
same minimal polynomial. This step, which is not at all obvious, is the heart of the 
proof. 

We prove the claim by contradiction. If it is false, then m^ e p ] (f) ^ ( t ). Define 

k(t) = JH[ £ p](/ p ) € Z[t] 


so 


k{e) = my eP i [ (e p ) = 0 
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Therefore m^ e ] (/) divides k(t) in Z[f], so there exists q(t ) € Z[t\ such that 

m [e] (t)q(t) = k(t) 

Reduce coefficients modulo p as in Section 3.5. Using bars to denote images modulo 

P’ 

m\ e ](t)q(t) = k(t) = m[ eP ](t p ) = (w[ gP] (f)) p 

since the Frobenius map is a monomorphism in characteristic p by Lemma 17.14. 
Therefore m[ £P ] ( t) and ( t) have a common zero in some extension field of Z p , so 
that 

=n»i[ £ ](o 

[e] 

has a repeated zero in some extension field of Z p . By Lemma 9.13 (generalised), 
t n — 1 and its formal derivative have a common zero. However, the formal derivative 
of t n — 1 is fit” -1 and h /= 0 since p\n. Now 

1=1 

n 

so no such common zero exists (that is, iit n 1 and t" — 1 are coprime). This contra¬ 
diction shows that e and e p have the same minimal polynomial. 

It follows that e and e" have the same minimal polynomial for every u = p\.. .pi, 
where the pj are primes not dividing n. These u are precisely the natural numbers that 
are prime to n, so modulo n they form the group of units Z*. However, the primitive 
nth roots of unity are precisely the elements e" for such u. □ 

Definition 21.5. The polynomial <t>^(f) defined by 

®n{t) = FI ( f -C a ) (2U0) 

a£Z n ,(a,n)=l 

is the nth cyclotomic polynomial over C. 

Corollary 21.6. For all n G N, the polynomial lies in Z [f] and is monic and 

irreducible. 


21.6 Galois Group of Q(Q : Q 

In Theorem 20.12 we described the Galois group of Q(£): Q when C is a primi¬ 
tive pth root of unity, p prime. We now generalise this result to the composite case. 

Let f(t) =t n — 1 S Q[f]. The zeros in C are 1, ..., t^ n ~ l where ^ = e 271 '/" 

is a primitive nth root of unity. The splitting field of / is clearly Q(C)- Theorem 9.9 
implies that the extension Q(£) : Q is normal. By Proposition 9.14 it is separable. 

We will need: 
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Definition 21.7. The group of units Z* of Z„ consists of the elements a £ Z„ such 
that 1 < a < n and a is prime to n, under the operation of multiplication. 

The order of this group is given by an important number-theoretic function: 

Definition 21.8. The Euler function (j>(n) is the number of integers a , with 1 < a < 
n — 1 , such that a is prime to n. 

Definition 21.8 implies immediately that the order of Z* is equal to (!) in). 

The Euler function (j>(n) has numerous interesting properties. In particular 

Hp k ) = (p-i)p k ~ l 


if p is prime, and 

H r )H s ) = H rs ) 

when r.s are coprime. See Exercise 12.4. 

We can now prove: 


Theorem 21.9. (1) The Galois group T(Q(£): Q) consists of the Q -automorphisms 

ytj defined by 


Vj(0 = S J 


where 0 < j < n — 1 and j is prime to n. 


(2) r(Q(C) : Q) is isomorphic to Z* and in particular is an abelian group. 

(3) Its order is <j)(n). 


(4) If n is prime, Z* is cyclic. 

Proof. (1) Let y G T(Q(£) : Q). Since y(Q is a zero of t" — 1, y — \ y/j for some j. 

If j and n have a common factor d > 1 then y/j is not onto and hence not a 
Q-automorphism. 

If j and n are coprime, there exist integers a , b such that aj + bn = 1. Then 


£ _ ^aj+bn _ j*aj£ 


bn 


(c j r 


so £ lies in the image of y/j. It follows that t f/j is a Q-automorphism. 

(2) Clearly yt/y/k = Yjk^ so the map y/j i—> j is an isomorphism from T(Q(£): Q) 
to Z*. 

(3) |E(Q(C): Q)| = |Z*| = 0(»). 

(4) This follows from Corollary 19.9. □ 


21.7 The Technical Lemma 

We can now fill in the technical gap in the proof of the Vandermonde-Gauss 
Theorem in Section 21.4. 
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Theorem 21.10. Let K be the splitting field of over Q. Then the Galois group 

of the extension K : Q is isomorphic to the group of units Z* of the ring Z„. 

Proof The zeros of <1>„(/) in C are powers (f l of a primitive nth root of unity 
where a ranges through the integers modulo n that are prime to n. The result is then 
a direct consequence of Theorem 21.9. □ 

We can now give the 

Proof of Theorem 21.3. Since Q(£) : Q is normal, every automorphism of Q(0£) 
over Q(0) carries Q(£) to itself. Therefore restriction of automorphisms gives a 
homomorphism 

IT. r(Q( 0 C):Q( 0 ))->r(Q(C):Q) 

Now T(Q(C) : Q) is cyclic of order p — 1, so it suffices to prove that y/ is an iso¬ 
morphism. Since Q(0£) = Q(0)(£), every automorphism of this field over Q(0) 
is determined by its effect on £. Therefore distinct automorphisms induce distinct 
automorphisms of Q(£), showing that y/ is one-to-one. 

To show it is onto, it suffices to prove that T(Q(0£) : Q(0)) and T(Q(£) : Q) 
have the same order. 

Denote a primitive nth root of unity by By Theorem 21.10, for every n the 
order of T(Q(£„) : Q) = Z* = (j>(n). The tower law implies that if 0 < r,s € N then 

But when r,s are coprime, (j>(rs) = 0(r)0(s), so 0(rs)/0(j) = <j>(r) = |r(Q(£ r ): Q)|. 
Set r = p,s = p— lto get what we require. □ 


21.8 More on Cyclotomic Polynomials 

It seems a shame to stop without saying a little more about the cyclotomic poly¬ 
nomials, because they are fascinating. 

Theorem 21.10 shows that the cyclotomic polynomial < t > „(/) is intimately asso¬ 
ciated with the ring Z„ and its group of units Z*, which we discussed briefly in 
Chapter 3. In particular, the order of this group is 

\K\ = <H") 

where (j) is the Euler function. Definition 21.8, so ®(n) is the number of integers a , 
with 1 < a < n — 1, such that a is prime to n. 

The most basic property of the cyclotomic polynomials is the identity 


d\n 


( 21 . 11 ) 
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which is a direct consequence of their definition. We can use this identity recursively 
to compute Thus 

4>i(t)=/-l 


SO 


which implies that 


Similarly 


and 


. t 2 - 1 t 2 - 1 
2W $i(f) f-1 

f 3 - 1 

= —-=t 2 +t+l 


<*>4(0 = 


f 4 - 1 


(t - l)(t + 1) 


= r +1 


and so on. Table 21.8 shows the first 15 cyclotomic polynomials, computed in this 
manner. A curiosity of the table is that the coefficients of <t>„ always seem to be 0,1, 
or — 1. Is this always true? See Exercise 21.11. 


n ®„(t) 


1 t -1 

2 t +1 

3 t 2 +t +1 

4 f 2 + l 

5 t 4 +t 3 +t 2 +t +1 

6 f 2 — t + 1 

7 f 6 +f 5 +f 4 +f 3 +f 2 +t + l 

8 f 4 +l 

9 f 6 +f 3 + l 

10 t 4 -t 3 +t 2 -t +1 

11 f 10 +f 9 + f 8 +f 7 +f 6 +f 5 + f 4 +f 3 +f 2 + f + 1 

12 f 4 -f 2 + l 

13 f 12 +f u +f 10 + f 9 + f 8 +f 7 + f 6 +f 5 +f 4 +f 3 +f 2 +f+ 1 

14 t 6 -t 5 +t 4 -t 2 +t 2 -t + 1 

15 f 8 -f 7 +f 5 -f 4 +f 3 -f + l 
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21.9 Constructions Using a Trisector 

For a final flourish, we apply our results to the construction of regular polygons 
when an angle-trisector is permitted, as well as the traditional ruler and compass. 
The results are instructive, amusing, and slightly surprising. For example, the regular 
7-gon can now be constructed. It is not immediately clear why the angle =j- arises 
from trisections. Other regular polygons, such as the 13-gon and 19-gon, also become 
constructible. On the other hand, the regular 11-gon still cannot be constructed. 

The main point is the link between trisection and irreducible cubic equations. The 
trigonometric solution of cubics. Exercise 1.8, shows that an angle-trisector can be 
used to solve some cubic equations: those in the ‘irreducible case’, with three distinct 
real roots. Specifically, we use the trigonometric identity cos 30 = 4cos 3 0 — 3 cos 0 
to solve the cubic equation t 3 +pt + q = 0 when 27 q 2 +4 p 3 < 0. This is the condition 
for three distinct real roots. The method is as follows. 

The inequality 27 pq 2 +4 /; 3 < 0 implies that p < 0, so we can find a,b such that 
p = —3a 2 ,q = —a 2 b. The cubic becomes 

t 3 — 3 a 2 t = a 2 b 

and the inequality becomes a > 1 * 1 / 2 . 

Substitute t = 2a cos 0, and observe that 

t 3 — 3 a 2 t = 8 a 3 cos 3 0 —6a 3 cos 0 = 2 a 3 cos 3 0 


The cubic thus reduces to 


which we can solve using cos 


b 

cos 30 = — 

2 a 

because * | < 1 , getting 


0 = 


cos 


b 
2 a 


There are three possible values of 0, the other two being obtained by adding ^ or 
^. Finally, eliminate 0 to get 


t = 2a cos 



b 
2 a 


where a = \J~^, b = y. 

Conversely, solving cubics with real coefficients and three distinct real roots lets 
us trisect angles. So when a trisector is made available, the constructible numbers 
now lie in a series of extensions, starting with Q, such that each successive extension 
has degree 2 or 3. 

The use of a trisector motivates a generalisation of Fermat primes, named after 
the mathematician James Pierpont. 
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Definition 21.11. A Pierpont prime is a prime p of the form 

p = 2 a 3 b + 1 

where a > \,b>0. 

(Here we exclude a = 0 because in this case 2 a 3* + 1 = 3* + 1 is even.) 

The Pierpont primes up to 100 are 3, 5, 7, 13, 17, 19, 37, 73, and 97. So they 
appear to be more common than Fermat primes, a point to which we return later. 

Andrew Gleason (1988) proved the following theorem characterising those reg¬ 
ular n-gons that can be constructed when the traditional instruments of Euclid are 
supplemented by an angle-trisector. He also gave explicit constructions of that kind 
for the regular 7-gon and 13-gon. 

Theorem 21.12. The regular n-gon can be constructed using ruler, compass, and 
trisector, if and only if n is of the form 2''3 s p\ ■ ■ ■ pk where r,s > 0 and the pj are 
distinct Pierpont primes > 3. 

Proof First, suppose that the regular n-gon can be constructed using ruler, compass, 
and trisector. As remarked above, this implies that the primitive nth root of unity 
£ = e, 2m / n lies in the largest field in some series of extensions, which starts with Q, 
such that each successive extension has degree 2 or 3. Therefore 

[Q(C):Q]=2 c 3 d 

for c,d € N. 

The degree [Q(£) : Q] equals <j>(n ), where <j> is the Euler function. This is the 
degree of the cyclotomic polynomial <t>„(t), which is irreducible over Q. Therefore a 
necessary condition for constructibility with ruler, compass, and trisector is © (n) = 
2 a 3 b for a, b € N. What does this imply about n? 

Write n as a product of distinct prime powers p ■ 1 . Then (/)(p - 1 ) must be of the 

form 2 a >3 b ’. Since 0 (/?'") = {p— 1 )p m ~ l when p is prime, we require (pj -1 p"’’ * to 
be of the form 2 a i3 b K 

Either mj = 1 or pj = 2,3. If pj = 2 then (/){pj j ) = 2"W 1 and any mj can occur. 
If pj = 3 then © ip'd ) = 2 • 3"'' 1 and again any power of 3 can occur. Otherwise 

mj = 1 so 0(p” ,; ) = tj>(pj) = Pj — 1, and pj = 2 a i3 b i + 1. Thus pj is a Pierpont 
prime. 

We have now proved the theorem in one direction: in order for the regular n-gon 
to be constructible by ruler, compass, and trisector, n must be a product of powers of 
2, powers of 3, and distinct Pierpont primes > 3. 

We claim that the converse is also true. 

The proof is a simple application of Galois theory. Let p = 2“3 h + 1 be an odd 
prime. Let £ = e 2m /P. Then [Q(£) : Q] = p — 1 = 2 a 3 b . The extension Q(£) : Q 
is normal and separable, so the Galois correspondence is a bijection, and the Ga¬ 
lois group r = r(Q(C) : Q) has order m = 2 a 3 b . By Theorem 21.10 it is abelian, 
isomorphic to h*„. Therefore it has a series of normal subgroups 


i =r 0 <r 1 <---<r r = r 
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where each factor r 7+ i /Tj is isomorphic either to Z2 or Z3. In fact, r = a + b. 

Let 

0 = C + C 1 = C + C p_1 = C + C = 2cos2n/p 

where the bar indicates complex conjugate. Then 0eR. Consider the tower of sub¬ 
fields 

Qcq(6)cq(C) 

Clearly Q(0) C M. We have £ + = 0, £ ■ £ _1 = 1, so £ and £ _1 are the zeros of 

f 2 — Qt + 1 over Q(0). Therefore [Q(£) : Q(0)] < 2, but £ ^ R. D Q(0) so [Q(£) : 

Q(0)]=2. 

The group A of order 2 generated by complex conjugation is a subgroup of T, 
and it is a normal subgroup since T is abelian. We claim that the fixed field A' = 
Q(0) = Q(C) HR. We have Q(£) C R so Q(C) C A 2 . Since [Q(£): Q(0)] = 2 the 
only subfield properly containing Q(£) is Q(0), and this is not fixed by A. Therefore 
Q(£) = A’*'. (It is easy to see that in fact, Q(0) = Q(^) ClR.) 

Therefore the Galois group of Q(0) : Q is isomorphic to the quotient group A = 
r /A, so it is cyclic of order in/ 2 = 2 a l 3 ,; . It has a series of normal subgroups 

1 = Ao <3 Ar o - • • o A,_ 1 = A 

where each factor A ,- + i /A ; is isomorphic either to Z2 or Z3. 

The corresponding fixed subfields Kj = A 'j form a tower 


Q{d)=K 0 DK 1 D---DK r _i=Q 


and each degree [Kj : Kj + 1 ] is either 2 or 3. So Kj can be obtained from Kj + 1 by 
adjoining either: 

a root of a quadratic over K J+ \ , or 

a root of an irreducible cubic over Kj + 1 with all three roots real 
(the latter because Q(0) C R). 

In the quadratic case, any z € Kj can be constructed from K r \ by ruler and 
compass. In the cubic case, any z £ Kj can be constructed from Kj + \ by trisector (plus 
ruler and compass for field operations). By backwards induction from K, | = Q, 
we see that any element of Kq can be constructed from Q by ruler, compass, and 
trisector. Finally, any element of Q(£) can be constructed from Q by ruler, compass, 
and trisector. In particular, £ can be so constructed, which gives a construction for a 
regular p- gon. □ 

This is a remarkable result, since at first sight there is no obvious link between 
regular polygons with (say) 7, 13, or 19 sides and angle-trisection. They appear to 
need division of an angle by 7, 13, or 19. So we give further detail for the first two 
cases, the 7-gon and the 13-gon. 

p = 7: Let C = e 2jr ' /7 . Recall the basic relation 

1 + C + C 2 + C 3 + C 4 + C 5 + C 6 = 0 


(21.12) 
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Define 

OjT 

n = C + C 6 = 2cosy eK 
r 2 = C 2 + C 5 = 2 cos+^ € R 
T 3 = £ 3 + £ 4 = 2 cos y gR 


Compute the elementary symmetric functions of the rj. By (21.12) 

n + n + r3 = -1 


Next, 

nr 2 r 3 = (C + C 6 )(C 2 + C 5 )(C 3 + C 4 ) 

= C 6 + C° + C 2 + C 3 + C 4 + C 5 + C° + C 

= 1 + 1-1 = l 


Finally, 

nr 2 +nr 3 +r 2 r 3 = (£ + C 6 )(C 2 + C 5 ) + (C+ C 6 )(C 3 + C 4 ) + (C 2 + C 5 )(C 3 + C 4 ) 

= C 3 + C 6 + C + C 4 + C 4 + C 5 + C 2 + C 3 + C 5 + C 6 + C + C 2 
= -2 


Therefore the rj are roots of the cubic t 3 + 2f 2 + f — 1 = 0. This is irreducible (exer¬ 
cise) and the roots rj are real. So they can be constructed using a trisector (plus ruler 
and compass for field operations). We omit details; an explicit construction can be 
found in Gleason (1988) and Conway and Guy (1996) page 200. 
p = 13; Let £ = e 27C ' /l:i . Recall the basic relation 


i + C + C 2 + --- + C 12 = o 


(21.13) 


Define rj = = 2cos ^ for 1 < j < 6 . 

It turns out that 2 is primitive root modulo 13. That is, the powers of 2 (mod 13) 
are, in order. 


1 2 4 8 3 6 12 11 9 5 10 7 


and then repeat: these are all the nonzero elements of Z 13 . 

Add powers of £ corresponding to every third number in this sequence: 

5! = C + C 8 + C 12 + C 5 =n+r 5 
s 2 = C 2 + C 3 + C U + C 10 = r 2 + G 
*3 = C 4 + C 6 + C 9 + C ? = '-4 + r 6 


Tedious but routine calculations show that the sj are the three roots of the cubic 


f 3 + f 2 — 4f + 1 =0 
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which is irreducible (exercise) and has all roots real. Therefore the Sj can be con¬ 
structed using trisector, ruler, and compass. 

Then, for example. 


n +r 5 = s 1 

rv 5 = (C + C 12 )(C 5 + C 8 ) 

= C 6 + C 9 + C 4 + C 7 = S3 

so r \, r 5 are roots of a quadratic over Q(«i, « 2 , S 3 ). The same goes for the other pairs 
of rj. Therefore we can construct the r ; by ruler and compass from the Sj. Finally, 
we can construct £ from the r ; by solving a quadratic, hence by ruler and compass. 

An explicit construction can again be found in Gleason (1988) and Conway and 
Guy (1996) page 200. 

Earlier, I said that the Pierpont primes p = 2 a 3 b + 1 form a much richer set 
than the Fermat primes. It is worth expanding on that statement. It is generally be¬ 
lieved that the only Fermat primes are the known ones, 2, 3, 5, 17, 257, and 65537, 
though this has not been proved. In contrast, Gleason (1988) conjectured that Pier¬ 
pont primes are so common that there should be infinitely many ; he suggested that 
there should be about 9k of them less than 10 A . More formally, the number of Pierpont 
primes less than N should be asymptotic to a constant times log A'. This conjecture 
remains open, but with modern computer algebra it is easy to explore larger values. 
For example, a quick, unsystematic search turned up the Pierpont prime 

2 148 3 95 + 1 = 756 760 676 272 923 020 551 154 471 073 
240 459 834 492 063 891 235 892 290 277 
703 256 956 240 171 581 788 957 704 193 

with 90 digits. There are 789 Pierpont primes up to 10 10 °. Currently, the largest 
known Pierpont prime is 3 x 2 7033641 + 1, proved prime by Michael Herder in 2011. 


EXERCISES 

21.1 Prove that, in the notation of Section 21.4, 


21.2 Prove that <£24 (f) = t s — t 4 + 1. 

21.3 Show that the zeros of the d/th cyclotomic polynomial can be expressed by 
radicals of degree at most max(2 ,(d— l)/2). (The 2 occurs because of the 
case d = 3.) 
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21.4 Use trigonometric identities to prove directly from the definition that <t> 12 (f) = 
f 4 -f 2 + 1. 

21.5 Prove that < 4 * 12(0 is irreducible over Q. 

21.6 Prove that if 9 is a primitive (p — l)th root of unity, then 

1 + e J + e 2j +--- + e (p - 2 > J = ( J ~~ 1 j -Z 0 

(0 if J < l < p — 2 

21.7 Prove that the coefficients of <t > p (t) are all contained in {—1,0,1} when p is 
prime. 

21.8 Prove that the coefficients of <4^/ (0 are all contained in { — 1,0,1} when p is 
prime and k > 1 . 

21.9 If m is odd, prove that < 4 > 2 m (f) = <4> m (—f), and deduce that the coefficients of 
<t> lp k(t) are contained in {— 1 , 0 , 1 } when p is an odd prime and k > 1 . 

21.10 If p,q are distinct odd primes, find a formula for <4> p 9 (f) and deduce that the 
coefficients of <4 > pq (t) are all contained in {— 1 , 0 , 1 }. 

21.11 Relate <4 ' pa (t) and <4>„A a (f) when a.p are odd, p is prime, p and a are co¬ 
prime, and k > 1. Deduce that if the coefficients of <4 > ;M (f) are all contained in 
{— 1 , 0 , 1 }, so are those of <4 > pka (t). 

21.12 Show that the smallest n such that the coefficients of <4> m (t) might not all be 
contained in {—1,0,1} is n = 105. If you have access to symbolic algebra 
software, or have an evening to spare, lots of paper, and are willing to be very 
careful checking your arithmetic, compute < 4 >io 5 (f) and see if some coefficient 
is not contained in {— 1 , 0 , 1 }. 

21.13 Let 0 {n) be the Euler function. Prove that 

Hp k ) = {p- 1)/” 1 


if p is prime, and 

0W0W =<t>(rs) 

when r,s are coprime. Deduce a formula for <j>(n) in terms of the prime fac¬ 
torisation of n. 

12.14 Prove that 

<l>(n) = n 11 

p prime, p\n ^ ' 

12.15 If a is prime to n, where both are integers, prove that = 0 (mod n). 

12.16 Prove that for any m € N the equation <t) (n) = m has only finitely many solu¬ 
tions n. Find examples to show that there may be more than one solution. 
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12.17 Experiment, make an educated guess, and prove a formula for Y.d\n 0(</). 

12.18 If n is odd, prove that (j)(4n) = 2(f) (n). 

12.19 Check that 

1+2 = ^( 3 ) 

1+3 = ^(4) 

1 +2 + 3+4 = ^(5) 

1+5 = ^(6) 

1+2 + 3+4 + 5 + 6 = ^0(7) 

What is the theorem? Prove it. 

12.20* Prove that if g £ Z, 4 then g 2 = 1, so g has order 2 or is the identity. Show 
that 24 is the largest value of n for which every non-identity element of Z* has 
order 2. Which are the others? 

21.21 Outline how to construct a regular 19-gon using ruler, compass, and trisector, 
along the lines discussed for the 7-gon and 13-gon. 

21.22 Extend the list of Pierpont primes up to 1000. 

21.23 If you have access to a computer algebra package, use it to extend the list of 
Pierpont primes up to 1,000,000. 

21.24 (1) Prove that 2"3 /; + 1 is composite if a and b have an odd common factor 

greater than 1. 

(2) Prove that 2 a 3 b + 1 is divisible by 5 if and only if a — b = 2 (mod 4). 

(3) Prove that 2 a 3 b + 1 is divisible by 7 if and only if a + 2b = 0 (mod 3). 

(4) Find similar necessary and sufficient conditions for 2 a 3 b + 1 to be divis¬ 
ible by 11, 13, 17, 19. 

(5) Prove that 2 a 3 b + 1 is never divisible by 23. 

[Hint: For (2, 3, 4, 5) prove that if p is prime then 2 a 3* + 1=0 (mod p) if and 
only if 2 a + 3 “ b = 0 (mod p), and look at powers of 2 and 3 modulo p.] 

21.25 Mark the following true or false. 

(a) Every root of unity in C has a expression by genuine radicals. 

(b) A primitive 11th root of unity in C can be expressed in terms of rational 
numbers using only square roots and fifth roots. 
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(c) Any two primitive roots of unity in C have the same minimal polynomial 
over Q. 

fd) The Galois group of over Q is cyclic for all n. 

(e) The Galois group of <f>„(V) over Q is abelian for all n. 

(f) The coefficients of any cyclotomic polynomial are all equal to 0, ±1. 

(g) The regular 483729409-gon can be constructed using ruler, compass, and 
trisector. (Hint: This number is prime, and you may assume this without 
further calculation.) 



Chapter 22 

Calculating Galois Groups 


In order to apply Galois theory to specific polynomials, it is necessary to compute 
the corresponding Galois group. This was the weak point in the memoir that Galois 
submitted to the French Academy of Sciences, as Poisson and Lacroix pointed out in 
their referees’ report. 

However, the computation is possible—at least in principle. It becomes practical 
only with modern computers. It is neither simple nor straightforward, and until now 
we have emulated Galois and strenuously avoided it. Instead we have either studied 
special equations whose Galois group is relatively easy to find (I did say ‘relatively’), 
resorted to special tricks, or obtained results that require only partial knowledge of 
the Galois group. The time has now come to face up squarely to the problem. This 
chapter contains relatively complete discussions for cubic and quartic polynomials. It 
also provides a general algorithm for equations of any degree, which is of theoretical 
importance but is too cumbersome to use in practice. More practical methods do 
exist, but they go beyond the scope of this book, see Soicher and McKay (1985) and 
the two references for Hulpke (Internet). The packages Maple and GAP can compute 
Galois groups for relatively small degrees. 


22.1 Transitive Subgroups 

We know that the Galois group T(f) of a polynomial / with no multiple zeros 
of degree n is (isomorphic to) a subgroup of the symmetric group §„. In classical 
terminology, F(/) permutes the roots of the equation f(t) = 0. Renumbering the 
roots changes T(f) to some conjugate subgroup of §„, so we need consider only the 
conjugacy classes of subgroups. However, S n has rather a lot of conjugacy classes 
of subgroups, even for moderate n (say n > 6). So the list of cases rapidly becomes 
unmanageable. 

However, iff is irreducible (which we may always assume when solving f(t) = 
0) we can place a fairly stringent restriction on the subgroups that can occur. To state 
it we need: 

Definition 22.1. Let G be a permutation group; that is, a subgroup of the group of 
all permutations on a set S. We say that G is transitive (or transitive on S ) if for all 
s,t £ S there exists y £ G such that y(s) = t. 
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To prove G transitive it is enough to show that for some fixed so £ S, and any 
s £ S, there exists y £ G such that yf.s'o) = s. For if this holds, then given t £ S there 
also exists 8 £G such that 8(sq) = t, so (57 _1 )(s) = t. 

Examples 22.2. (1) The Klein four-group V is transitive on {1, 2, 3,4}. The element 
1 is mapped to: 

1 by the identity 

2 by (12)(34) 

3 by (13)(24) 

4 by (14)(23) 

(2) The cyclic group generated by a = (1234) is transitive on {1, 2, 3, 4}. In fact, a' 
maps 1 to i for i = 1,2, 3,4. 

(3) The cyclic group generated by p = (123) is not transitive on {1, 2, 3, 4}. There 
is no power of p that maps 1 to 4. 

Proposition 22.3. The Galois group of an irreducible polynomial f is transitive on 
the set of zeros of f. 

Proof If a and p are two zeros of / then they have the same minimal polynomial, 
namely /. By Theorem 17.4 and Proposition 11.4 there exists yin the Galois group 
such that y(a) =/3. □ 

Listing the (conjugacy classes of) transitive subgroups of §„ is not as formida¬ 
ble as listing all (conjugacy classes of) subgroups. The transitive subgroups, up to 
conjugacy, have been classified for low values of n by Conway, Hulpke, and MacKay 
(1998). The GAP data library 

http://www.gap-system.org/Datalib/trans.html 
contains all transitive subgroups of S n for n < 30. The methods used can be found 
in Hulpke (1996). There is only one such subgroup when n = 2, two when n = 3, 
and five when n = 4,5. The magnitude of the task becomes apparent when n = 6: 
in this case there are 16 transitive subgroups up to conjugacy. The number drops to 
seven when n =7; in general prime n lead to fewer conjugacy classes of transitive 
subgroups than composite n of similar size. 


22.2 Bare Hands on the Cubic 

As motivation, we begin with a cubic equation over Q, where the answer can be 
obtained by direct ‘bare hands’ methods. Consider a cubic polynomial 

f(t)=t 3 -sit 2 +s 2 t-s 3 €®[t] 
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The coefficient sj are the elementary symmetric polynomials in the zeros d\ , o 2 , 03 , 
as in Section 18.2. If / is reducible then the calculation of its Galois group is easy: it 
is the trivial group, which we denote by 1 , if all zeros are rational, and §2 otherwise. 
Thus we may assume that / is irreducible over Q. 

Let E be the splitting field of /, 

E = Q(ai, a 2 , a 3 ) 

By Proposition 22.3 the Galois group of / is a transitive subgroup of §3, hence is 
either S3 or A3. Suppose for argument’s sake that it is A3. What does this imply 
about the zeros a\, a 2 , O 3 I By the Galois correspondence, the fixed field A3 of A3 
is Q. Now A3 consists of the identity, and the two cyclic permutations (123) and 
(132). Any expression in a\ , a 2 . 0:3 that is invariant under cyclic permutations must 
therefore lie in Q. Two obvious expressions of this type are 

0 = o\a 2 + do 013 + af «i 


and 


y/ = 0 / 0:3 + OoOi + a 3 a 2 


Indeed it can, with a little effort, be shown that 


A 3 = Q(0,y/) 


(see Exercise 22.3). In other words, the Galois group of / is A 3 if and only if 0 and 
y/ are rational. 

This is useful only if we can calculate 0 and yr, which we now do. Because S3 
is generated by A3 together with the transposition (12), which interchanges <j) and 
yr, it follows that both 0 + yt and 0 yt are symmetric polynomials in 0 £i, a?, 0 : 3 . 
By Theorem 18.10 they are therefore polynomials in si,S 2 > an d S 3 . We can compute 
these polynomials explicitly, as follows. We have 

0 + V' r =E a i 2 °v 

¥j 

Compare this with 

S1S2 = {ai + a 2 + ai)(aia 2 + a 2 a 3 + a 3 ai) = E</ 2 «/ +3 ai 0:20:3 

¥J 


Since o.\ 0 . 10.3 = S 3 we deduce that 

<j> + yf = S 1 S 2 — 3s 3 


Similarly 

t py/ = Oia 2 a 3 + a 2 a3ai + a 3 aia 2 + afal + Ona^ + ajaf + 30^0^03 

= s 3 (a 1 3 + a| + a 3 ) + 3s§ + E o:, 3 a 3 

i<j 
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Now 


so that 


Moreover, 


Therefore 


«i = («i +a 2 + « 3 ) 3 

= (a, 3 + a| + a 3 ) + 3 Y °f a i + 6cq 0£2«3 
¥j 

a l + a 2 + a 3 = *1 — 6*3 — 3(*l*2 ^ 3*3) 

sl = (aia2 + a2«3 + a3ai) 3 

= Y a f a ] + 3 Y a f a - 2 j a k + 6af a? a 3 

i<j hj,k 

= Y a i a j + 3s i (E «?«/■) + 

<<y ' ' 

Y a f a j = ^2 3*3 (*i *2 3^3) 6*3 

i<j 

= s\ — 3i 1*2*3 + 3*3 


Putting all these together, 

< j ) y / = *3(* 3 —3*1*2+ 3*3)+*2 + 3*3 —3*l*2*3 +3*3 

= *1*3 + 9*3 — 6 * 1*2*3 +*? 


Hence <j> and t// are the roots of the quadratic equation 

t — at + b = 0 


where 


a = * 1*2 — 3*3 

b = *3(* 3 — 3*1*2+ 3*3 )+*2 + 3*3 — 3*i*2*3+3*3 

By the formula for quadratics, this equation has rational zeros if and only if 
sj a 2 — 4/7 G Q. Direct calculation shows that 

a 2 — 4b = * 1*2 + 18 * 1 * 2*3 — 27*3 — 4* 3 *3 — 4*2 

We denote this expression by A, because it turns out to be the discriminant of /. Thus 
we have proved: 

Proposition 22.4. Let f[t) = f 3 — s\t 2 +* 2 / -*3 £ Q[f] be irreducible over Q. Then 
its Galois group is A3 if 

A = * 1*2 + 18*1*2*3 — 27*3 — 4*1*3 — 4*2 

is a perfect square in Q, and is S3 otherwise. 
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Examples 22.5. (1) Let fit) = f 3 +3t + 1. This is irreducible, and 
si = 0 *2 = 3 S3 = — 1 

We find that A = —27 — 4.27 = —135, which is not a square. Hence the Galois group 

is S 3 . 

(2) Let f(t ) = f 3 — 3t — 1. This is irreducible, and 

si =0 S2 = —3 S3 = 1 

Now A = 81, which is a square. Hence the Galois group is A 3 . 


22.3 The Discriminant 

More elaborate versions of the above method can be used to treat quartics or 
quintics, but in this form the calculations are very unstructured. See Exercise 22.6 
for quartics. In this section we provide an interpretation of the expression A above, 
and show that a generalisation of it distinguishes between polynomials of degree n 
whose Galois groups are, or are not, contained in A„. 

The definition of the discriminant generalises to any field: 

Definition 22.6. Suppose that /(f) £ Kit) and let its zeros in a splitting field be 
a 1 _ ,a„. Let 

s=n 

i<j 

Then the discriminant A(/) of / is 

A if) = S 2 

Theorem 22.7. Let f £ K[t], where the characteristic of K is not 2. Then 

(1) A(/) £ K. 

(2) A if) = 0 if and only iff has a multiple zero. 

(3) If A(/) / 0 then A(/) is a perfect square in K if and only if the Galois group 
of f, interpreted as a group of permutations of the zeros of f, is contained in 
the alternating group A„. 

Proof. Let a £ S„, acting by permutations of the (Xj. It is easy to check that if a is 
applied to 5 then it changes it to ±5, the sign being + if a is an even permutation and 
- if cr is odd. (Indeed in many algebra texts the sign of a permutation is defined in this 
manner.) Therefore 8 £ A,). Further, A(/) = 8 2 is unchanged by any permutation 
in §„, hence lies in K. This proves (1). 
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Part (2) follows from the definition of A(/). 

Let G be the Galois group of /, considered as a subgroup of §„. If A(/) is a 
perfect square in K then 8 £ K, so 8 is fixed by G. Now odd permutations change 8 
to —5, and since char (A) / 2 we have 8 / — 8. Therefore all permutations in G are 
even, that is, C C A„. Conversely, if G C A„ then 8 £ G K. Therefore A(/) is a 
perfect square in K. □ 

In order to apply Theorem 22.7, we must calculate A(/) explicitly. Because it is 
a symmetric polynomial in the zeros Otj, it must be given by some polynomial in the 
elementary symmetric polynomials ,s> . Brute force calculations show that if / is a 
cubic polynomial then 

A (/) = + I 8 .s 1 .s 2 s 3 — 21 s\ — As\s^ — As\ 

which is precisely the expression A obtained in Proposition 22.4. Proposition 22.4 is 
thus a corollary of Theorem 22.7. 


22.4 General Algorithm for the Galois Group 

We now describe a method which, in principle, will compute the Galois group of 
any polynomial. The practical obstacles involved in carrying it out are considerable 
for equations of even modestly high degree, but it does have the virtue of showing that 
the problem possesses an algorithmic solution. More efficient algorithms have been 
invented, but to describe them would take us too far afield: see previous references 
in this chapter. 

Suppose that 

f(t) =t n -s l t n ~ 1 + ••• + (- 1 )% 

is a monic irreducible polynomial over a field K , having distinct zeros oq,..., a„ 
in a splitting field E. That is, we assume / is separable. The are the elementary 
symmetric polynomials in the oq. The idea is to consider not just how an element 
y of the Galois group G of / acts on oq,..., a n , but how y acts on arbitrary ‘linear 
combinations’ 

p — a'i oq H— .V/; a„ 

To make this action computable we form polynomials having zeros y( /J) as y runs 
through G. To do so, let x\,...,x„ be independent indeterminates, let p be defined as 
above, and for every a £ S„ define 

Gx(fi) = T ''' T^(r(n)®n 

Ga(P) =xi0£ ct (i)4- 1 -x n a a ^ 

By rearranging terms, we see that < 7 a (j 3 ) = 

(The notation here reminds us that <7 V acts on the xj, whereas o a acts on the oq .) 
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< 2 = n (*-*&))= n ('-*«(/») 

aeS„ treS„ 

If we use the second expression for Q, expand in powers of t, collect like terms, and 
write all symmetric polynomials in the dj as polynomials in the we find that 

Q= E ■■■ X 'r V 

j=0 V i / 

where the gj are explicitly computable functions of ,v |...., s n . In particular 
Q £ K[t,x i,... ,x„], (In the second sum above, i ranges over all u-tuples of nonnega¬ 
tive integers ( 4 ,..., i n ) with i\ -\ -f i„ + j = n) 

Next we split Q into a product of irreducibles, 

Q = Q\ ■ ■ -Qk 

in K[t,x 1 ,... ,x n ]. In the ring E[f ,xi. ,x„] we can write 

qj= n 

oeSj 

where S„ is the disjoint union of the subsets Sj. We choose the labels so that the 
identity of S n is contained in ,Sj , and then t — [5 divides Q\ in Y.\t,x 1 ,... ,x n ]. 

If a £ S„ then 

Q = <J x Q=(o x Qi)--(o x Q k ) 

Hence <J X permutes the irreducible factors Q t of Q. Define 

G = {t7eS„ : a x Q x =Qx} 

a subgroup of §„. Then we have the following characterisation of the Galois group 

off: 

Theorem 22.8. The Galois group G offis isomorphic to the group G. 

Proof The subset .Sj of §„ is in fact equal to G, because 

.Sj = {a :t — o x fi divides Q\ in T\t,x 1 ,... ,x„}} 

= {ct: t — p divides Of 1 Q\ in E[f,xi,... ,x„]} 

= {a : of x Qx = 0 i} 

= G 

Define 

H=n(t-o a m = n(t-o x m 

oeG aeG 

Clearly H £ K[t,x \,... ,x n \. Now H divides Q in E[t,xi,... ,x n ] so H divides Q 
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in Z(xi,... ,x n )[t]. Therefore H divides Q in K (xi,... -x n )[t] so that H divides Q in 
K[t,x i,... ,x n \ by the analogue of Gauss’s Lemma for K{x\,... ,x n ) [f], which can be 
proved in a similar manner to Lemma 3.17. 

Thus H is a product of some of the irreducible factors Qj of Q. Because y — 
p divides H we know that Q\ is one of these factors. Therefore Q\ divides H in 
K[t,x i,... ,x n ] so G C G. 

Conversely, let yG G and apply the automorphism y to the relation (t — P)\Qi- 
Since Q\ has coefficients in K , we get (t — Ya(P))\Q\ ■ Now t — Ya(P) = f — y/ 1 ] P) = 
y~ l (t - P), so Yx l {t ~ j3)|2i. Equivalently, (f — j3)|y v (2i). But Q\ is the unique 
irreducible factor of Q that is divisible by t — j3, so Yx(Q l) = Qu so y € G. 

□ 

Example 22.9. Suppose that a, ft are the zeros of a quadratic polynomial t 2 —At + 
B = 0, where A = a+ P and B = a/3. The polynomial Q takes the form 

Q = (t — ax — Py) (t — ay — j3x) 

= t 2 -t(ax + py + ay + px) + [(a 2 + p 2 )xy + ap (x 2 +y 2 )\ 

= t 2 -t(Ax + Ay) + [(A 2 -2B)xy + B(x 2 +y 2 )] 

This is either irreducible or has two linear factors. The condition for irreducibility is 
that 

A 2 (x + y) 2 -4[(A 2 -2B)xy + B(x 2 +y 2 )} 

is not a perfect square. But this is equal to 

(A 2 — AB)(x — y) 2 

which is a perfect square if and only if A 2 — 4B is a perfect square. Thus the Galois 
group G is trivial if A 2 — AB is a perfect square, and is cyclic of order 2 if A 2 — 4 B is 
not a perfect square. 

It is of course much simpler to prove this directly, but the calculation illustrates 
how the theorem works. 


EXERCISES 

22.1 Let / £ K[t] where char (K) / 2. If A(/) is not a perfect square in K and G 
is the Galois group of /, show that G\ TA„ has fixed field 

22.2* Find an expression for the discriminant of a quartic polynomial. (Hint: You 
may assume without proof that this is the same as the discriminant of its resol¬ 
vent cubic.] 

22.3 In the notation of Proposition 22.4, show that = Q(0, t//). 
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22.4 Show that 8 or —5 in Definition 22.6 is given by the Vandermonde determinant 
(see Exercise 2.5) 

1 1 ... 1 

a, a 2 ... a n 

a\ a\ ... a,“ 

a " -1 a " -1 ... a ,"" 1 

Multiply this matrix by its transpose and take the determinant to show that 
A(/) is equal to 

Ao Ai ... A w _\ 

At ... A„ 

A/; | A„ ... A 2 /7 — 2 

where At = af 4 -b a*. Hence, using Exercise 18.17, compute A(/) when / 

is of degree 2, 3, or 4. Check your result is the same as that obtained previously. 

22.5* If f(t ) = t" + at + b, show that 

A(/) = - l)" -1 a" 

where /i„ is 1 if n is a multiple of 4 and is — 1 otherwise. 

22.6* Show that any transitive subgroup of S 4 is conjugate to one of 84 ^ 4 , 04 , V, or 
Z 4 , defined as follows: 

A 4 = alternating group of degree 4 
V = {1, (12)(34), (13)(24), (14)(23)} 

O 4 = group generated by V and (12) 

Z 4 = group generated by (1234) 

22.7* Let / be a monic irreducible quartic polynomial over a field K of characteris¬ 
tic 7 ^ 2, 3 with discriminant A. Let g be its resolvent cubic, defined by the same 
formula that we derived for the general quartic, and let M be a splitting field 
for g. Show that: 

(a) r(/) = S 4 if and only if A is not a square in K and g is irreducible over 
K. 

(b) r(/) = A 4 if and only if A is a square in K and g is irreducible over K. 

(c) r(/) = O 4 if and only if A is not a square in K , g is reducible over K , and 
/ is irreducible over M. 
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(d) r(/) = Y if and only if A is a square in K and g is reducible over K. 

(e) r(/) = Z 4 if and only if A is not a square in K,g is reducible over K , and 
/ is reducible over M. 

22.8 Prove that {(123), (456), (14)} generates a transitive subgroup of § 6 . 

22.9 Mark the following true or false. 

(a) Every nontrivial normal subgroup of S„ is transitive. 

(b) Every nontrivial subgroup of S„ is transitive. 

(c) Every transitive subgroup of S„ is normal. 

(d) Every transitive subgroup of S„ has order divisible by n. 

(e) The Galois group of any irreducible cubic polynomial over a field of 
characteristic zero is isomorphic either to S 3 or to A 3 . 

(f) If A is a field of characteristic zero in which every element is a perfect 
square, then the Galois group of any irreducible cubic polynomial over 
K is isomorphic to A 3 . 



Chapter 23 

Algebraically Closed Fields 


Back to square one. 

In Chapter 2 we proved the Fundamental Theorem of Algebra, Theorem 2.4, 
using some basic point-set topology and simple estimates. It is also possible to give 
an ‘almost’ algebraic proof, in which the only extraneous information required is 
that every polynomial of odd degree over R has a real zero. This follows immediately 
from the continuity of polynomials over R and the fact that an odd degree polynomial 
changes sign somewhere between — °° and +°°. 

We now present this almost-algebraic proof, which applies to a slight general¬ 
isation. The main property of R that we require is that R is an ordered field, with 
a relation < that satisfies the usual properties. So we start by defining an ordered 
field. Then we develop some group theory, a far-reaching generalisation of Cauchy’s 
Theorem due to the Norwegian mathematician Ludwig Sylow, about the existence 
of certain subgroups of prime power order in any finite group. Finally, we combine 
Sylow’s Theorem with the Galois correspondence to prove the main theorem, which 
we set in the general context of an ‘algebraically closed’ field. 


23.1 Ordered Fields and Their Extensions 

As remarked in Chapter 2, the first proof of the Fundamental Theorem of Algebra 
was given by Gauss in his doctoral dissertation of 1799. His title (in Latin) was A 
New Proof that Every Rational Integral Function of One Variable can be Resolved 
into Real Factors of the First or Second Degree. Gauss was being polite in using the 
word ‘new’, because his was the first genuine proof. Even his proof, from the modern 
viewpoint, has gaps; but these are topological in nature and not hard to fill. In Gauss’s 
day they were not considered to be gaps at all. Gauss came up with several different 
proofs of the Fundamental Theorem of Algebra; among them is a topological proof 
that can be found in Hardy (1960 page 492). 

As discussed in Chapter 2, many other proofs are now known. Several of them 
use complex analysis. The one in Titchmarsh (1960 page 118) is probably the proof 
most commonly encountered in an undergraduate course. 

Less well known is a proof by Clifford (1968 page 20) which is almost entirely 
algebraic. His idea is to show that any irreducible polynomial over R is of degree 1 
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or 2. The proof we give here is essentially due to Legendre, but his original proof had 
gaps which we fill using Galois theory. 

It is unreasonable to ask for a purely algebraic proof of the theorem, since the real 
numbers (and hence the complex numbers) are defined in terms of analytic concepts 
such as Cauchy sequences. Dedekind cuts, or completeness in an ordering. 

We begin by abstracting some properties of the reals. 

Definition 23.1. An ordered field is a field K with a relation < such that: 

(1) k < k for all k £ K. 

(2) k < l and / < m implies k < m for all k, l,m £ K. 

(3) k < l and / < k implies k = l for all k,l £ K. 

(4) If k, l £ K then either k < l or / < k. 

(5) If k, l , m £ K and k < / then k + m < l + m. 

( 6 ) If k, l , m £ K and k < l and 0 < in then km < Im. 

The relation < is an ordering on K. The associated relations <, >, > are defined in 
terms of < in the obvious way, as are the concepts ‘positive’ and ‘negative’. 

Examples of ordered fields are Q and R. We need two simple consequences of 
the definition of an ordered field. 

Lemma 23.2. Let K be an ordered field. Then for any k £ K we have k 2 > 0. Further, 
the characteristic of K is zero. 

Proof. If k > 0 then k 2 > 0 by ( 6 ). So by (3) and (4) we may assume k < 0. If now 
we had —k < 0 it would follow that 

0 = k + (—k) < k + 0 = k 

a contradiction. So —k > 0, whence k 2 = (—k) 2 > 0. This proves the first statement. 
We now know that 1 = l 2 > 0, so for any finite n the number 

n ■ 1 = 1 4-1-1 >0 

implying that n ■ 1 f 0 and K must have characteristic 0. □ 

We quote the following properties of R. 

Lemma 23.3. K, with the usual ordering, is an ordered field. Every positive element 
ofM. has a square root in R. Every odd degree polynomial over R has a zero in R. 

These are all proved in any course in analysis, and depend on the fact that a 
polynomial function on R is continuous. 
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23.2 Sylow’s Theorem 

Next, we set up the necessary group theory. Sylow’s Theorem is based on the 
concept of a p-group: 

Definition 23.4. Let p be a prime. A finite group G is a p-group if its order is a power 
of p. 

For example, the dihedral group O 4 is a 2-group. If n > 3, then the symmetric 
group S n is never a p-group for any prime p. 

The /;-groups have many pleasant properties (and many unpleasant ones, but we 
shall not dwell on their Dark Side). One is: 

Theorem 23.5. If G f 1 is a finite p-group, then G has non-trivial centre. 

Proof. The class equation (14.2) of G reads 

p' 1 = |G| = 1 + \C 2 \ -\ -h \C r \ 

and Corollary 14.12 implies that \Cf = p n > for some ”j> o- Now p divides the right- 
hand side of the class equation, so that at least p— 1 values of \Cj\ must be equal to 
1. But if x lies in a conjugacy class with only one element, then g 1 xg = x for all 
g g G, that is, gx = xg. Hence x g Z(G). Therefore Z(G) f 1. □ 

From this we easily deduce: 

Lemma 23.6. If G is a finite p-group of order p", then G has a series of normal 
subgroups 

1 = G 0 C Gj C ... C G„ = G 
such that \Gj\ = p 1 for all j = 0 

Proof. Use induction on n. If n = 0 all is clear. If not, let Z = Z(G) / I by Theo¬ 
rem 23.5. Since Z is an abelian group of order p' n it has an element of order p. The 
cyclic subgroup K generated by such an element has order p and is normal in G since 
K C Z. Now G/K is a p-group of order p" 1 , and by induction there is a series of 
normal subgroups 

K/K = Gi/KC...CG n /K 

where \Gj/K\ = p 7 1 . But then Gj = p' and G ; <lG. If we let Go = 1, the result 
follows. □ 

Corollary 23.7. Every finite p-group is soluble. 

Proof. The quotients Gj+i/Gj of the series afforded by Lemma 23.6 are of order p, 
hence cyclic and in particular abelian. □ 
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In 1872 Sylow discovered some fundamental theorems about the existence of p- 
groups inside given finite groups. We shall need one of his results in this chapter. We 
state all of his results, though we shall prove only the one that we require, statement 

( 1 ). 

Theorem 23.8 (Sylow’s Theorem). Let G be a finite group of order p a r where p is 
prime and does not divide r. Then 

(1) G possesses at least one subgroup of order p a . 

(2) All such subgroups are conjugate in G. 

(3) Any p-subgroup of G is contained in one of order p a . 

(4) The number of subgroups of G of order p a leaves remainder 1 on division by 
P- 

This result motivates: 

Definition 23.9. If G is a finite group of order p“r where p is prime and does not 
divide r, then a Sylow p-subgroup of G is a subgroup of G of order //'. 

In this terminology Theorem 23.8 says that for finite groups Sylow p-subgroups 
exist for all primes p. are all conjugate, are the maximal p-subgroups of G, and occur 
in numbers restricted by condition (4). 

Proof of Theorem 23.8(1). Use induction on |G|. The theorem is obviously true for 
|G| = 1 or 2. Let Ci.... ,C S be the conjugacy classes of G, and let Cj = \Cj\. The class 
equation of G is 

p a r = ci + ---+c s (23.1) 

Let Zj denote the centraliser in G of some element Xj £ Cj. and let nj = \Zj\. By 
Lemma 14.11 

n j = p“ r / c j (23.2) 

Suppose first that some Cj is greater than 1 and not divisible by p. Then by (23.2) 
nj < p“r and is divisible by p". Hence by induction Z ; contains a subgroup of order 
p a . Therefore we may assume that for all j = 1..... ,v either cj = 1 or p\cj. Let 
z = |Z(G)|. As in Theorem 23.5, z is the number of values of i such that Cj = 1. So 
p a r = z + kp for some integer k. Hence p divides z, and G has a non-trivial centre 
Z such that p divides \Z\. By Lemma 14.14, the group Z has an element of order p, 
which generates a subgroup P of G of order p. Since P C Z it follows that P<i G. By 
induction G/P contains a subgroup S/P of order //' 1 , whence S is a subgroup of G 
of order p a and the theorem is proved. □ 

Example 23.10. Let G = S 4 , so that |G| = 24. According to Sylow’s theorem G must 
have subgroups of orders 3 and 8 . Subgroups of order 3 are easy to find: any 3-cycle, 
such as (123) or (134) or (234), generates such a group. We shall find a subgroup of 
order 8 . Let V be the Klein four-group, which is normal in G. Let T be any 2-cycle, 
generating a subgroup T of order 2. Then ¥n T = 1, and NT is a subgroup of order 
8 . (It is isomorphic to O 4 .) 



The Algebraic Proof 


281 


Analogues of Sylow’s theorem do not work as soon as we go beyond prime pow¬ 
ers. Exercise 23.1 illustrates this point. 


23.3 The Algebraic Proof 

With Sylow’s Theorem under our belt, all that remains is to set up a little more 
Galois-theoretic machinery. 

Lemma 23.11. Let K be a field of characteristic zero, such that for some prime p 
every finite extension MofK with Mf^K has [M : K] divisible by p. Then every finite 
extension ofK has degree a power ofp. 

Proof Let A be a finite extension of K. The characteristic is zero so N: K is separa¬ 
ble. By passing to a normal closure we may assume N: K is also normal, so that the 
Galois correspondence is bijective. Let G be the Galois group of N: K , and let P be 
a Sylow p-subgroup of G. The fixed field P' has degree [P : A] equal to the index of 
P in G (Theorem 12.2(3)), but this is prime to p. By hypothesis, l yi = K, so P = G. 
Then [N : K] = \G\ = p n for some n. □ 

Theorem 23.12. Let K be an ordered field in which every positive element has a 
square root and every odd-degree polynomial has a zero. Then K(i) is algebraically 
closed, where i 2 = — 1. 

Proof K cannot have any extensions of finite odd degree greater than 1. For suppose 
[M : K]=r > 1 where r is odd. Let a £ M\K have minimal polynomial m. Then dm 
divides r, so is odd. By hypothesis m has a zero in K, so is reducible, contradicting 
Lemma 5.6. Hence every finite extension of K has even degree over K. The charac¬ 
teristic of K is 0 by Lemma 23.2, so by Lemma 23.11 every finite extension of K has 
2 -power degree. 

Let M f K(i) be any finite extension of K(i) where i 1 = - 1. By taking a normal 
closure we may assume M : K is normal, so the Galois group of M : K is a 2-group. 
Using Lemma 23.6 and the Galois correspondence, we can find an extension N of 
K{i) of degree [N : K(i)] = 2. By the formula for solving quadratic equations, N = 
K(i)(a ) where a 2 £ K(i). But if a,b € K then recall (2.5): 


V a + bi = 


i + V a 2 + b 2 . —a + \/a 2 +b 2 


where the square root of a 2 + b 2 is the positive one, and the signs of the other two 
square roots are chosen to make their product equal to b. The square roots exist in K 
since the elements inside them are positive, as is easily checked. 

Therefore a £ K(i), so that N = K(i), which contradicts our assumption on N. 
Therefore M = K(i), and K{i ) has no finite extensions of degree > 1. Hence any 
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irreducible polynomial over K(i) has degree 1, otherwise a splitting field would have 
finite degree > 1 over K(i). Therefore K(i) is algebraically closed. □ 

Corollary 23.13 (Fundamental Theorem of Algebra). The field C of complex 
numbers is algebraically closed. 

Proof. Put R = K in Theorem 23.12 and use Lemma 23.3. □ 


EXERCISES 

23.1 Show that A 5 has no subgroup of order 15. 

23.2 Show that a subgroup or a quotient of a p-group is again a /;-group. Show that 
an extension of a /;-group by a /;-group is a p-group. 

23.3 Show that S„ has trivial centre if n > 3. 

23.4 Prove that every group of order /riwith p prime) is abelian. Hence show that 
there are exactly two non-isomorphic groups of order p 2 for any prime number 
P- 

23.5 Show that a field K is algebraically closed if and only if L.K algebraic implies 
L = K. 

23.6 Show that every algebraic extension of R is isomorphic to R : R or C : R. 

23.7 Show that C, with the traditional field operations, cannot be given the structure 
of an ordered field. If we allow different field operations, can the set C be given 
the structure of an ordered field? 

23.8 Prove the theorem whose statement is the title of Gauss’s doctoral dissertation 
mentioned at the beginning of the chapter. (‘Rational integral function’ was his 
term for ‘polynomial’.) 

23.9 Suppose that K:Q is a finitely generated extension. Prove that there exists a 
Q-monomorphism C. (Hint: Use cardinality considerations to adjoin tran¬ 
scendental elements, and algebraic closure of C to adjoin algebraic elements.) 
Is the theorem true for R rather than C? 

23.10 Mark the following true or false. 

(a) Every soluble group is a /?-group. 

(b) Every Sylow subgroup of a finite group is soluble. 

(c) Every simple p-group is abelian. 
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(d) The field A of algebraic numbers defined in Example 17.4 is algebraically 
closed. 

(e) There is no ordering on C making it into an ordered field. 

(f) Every ordered field has characteristic zero. 

(g) Every field of characteristic zero can be ordered. 

(h) In an ordered field, every square is positive. 

(i) In an ordered field, every positive element is a square. 
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Transcendental Numbers 


Our discussion of the three geometric problems of antiquity—trisecting the angle, 
duplicating the cube, and squaring the circle—left one key fact unproved. To com¬ 
plete the proof of the impossibility of squaring the circle by a ruler-and-compass 
construction, crowning three thousand years of mathematical effort, we must prove 
that n is transcendental over Q. (In this chapter the word ‘transcendental’ will be 
understood to mean transcendental over Q.) The proof we give is analytic, which 
should not really be surprising since 7Z is best defined analytically. The techniques 
involve symmetric polynomials, integration, differentiation, and some manipulation 
of inequalities, together with a healthy lack of respect for apparently complicated 
expressions. 

It is not at all obvious that transcendental real (or complex) numbers exist. That 
they do was first proved by Liouville in 1844, by considering the approximation 
of reals by rationals. It transpires that algebraic numbers cannot be approximated 
by rationals with more than a certain ‘speed’ (see Exercises 24.5-24.7). To find a 
transcendental number reduces to finding a number that can be approximated more 
rapidly than the known bound for algebraic numbers. Liouville showed that this is 
the case for the real number 

oo 

£ = £ icr" ! 

n= 1 

but no ‘naturally occurring’ number was proved transcendental until Charles Her- 
mite, in 1873, proved that e, the ‘base of natural logarithms’, is. Using similar meth¬ 
ods, Ferdinand Lindemann demonstrated the transcendence of n in 1882. 

Meanwhile Georg Cantor, in 1874, had produced a revolutionary proof of the 
existence of transcendental numbers, without actually constructing any. His proof 
(see Exercises 24.1-24.4) used set-theoretic methods, and was one of the earliest 
triumphs of Cantor’s theory of infinite cardinals. When it first appeared, the math¬ 
ematical world viewed it with great suspicion, but nowadays it scarcely raises an 
eyebrow. 

We shall prove four theorems in this chapter. In each case the proof proceeds by 
contradiction, and the final blow is dealt by the following simple result: 

Lemma 24.1. Let f : Z —> Z be a function such that f{n) —> 0 as n —> +°°. Then 
there exists N € Z such that f(n) = 0 for all n > N. 

Proof. Since f(n) —> 0 as n —> +°°, there exists N £ Z such that |/(n) — 0| < \ 
whenever n > N , for some integer W Since f(n) is an integer, this implies that /(«) = 
0 for n>N. □ 
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24.1 Irrationality 

Lindemann’s proof is ingenious and intricate. To prepare the way we first prove 
some simpler theorems of the same general type. These results are not needed for 
Lindemann’s proof, but familiarity with the ideas is. The first theorem was initially 
proved by Johann Heinrich Lambert in 1770 using continued fractions, although it is 
often credited to Legendre. 

Theorem 24.2. The real number n is irrational. 

Proof. Consider the integral 

In — J (1 —x 2 )"cos(ax)dv 

Integrating by parts, twice, and performing some fairly routine calculations, this leads 
to a recurrence relation 

a 2 I n = 2n(2n - l)/„_i -4 n(n - \)I n -i (24.1) 

if n > 2. After evaluating the cases n = 0.1, induction on n yields 

a 2n+1 I n = n\(P„sin(a) + Q„cos(a)) (24.2) 

where P„ and Q„ are polynomials in a of degree <2n+l with integer coefficients. 
The term n\ comes from the factor 2n(2n — 1) of (24.1). 

Assume, for a contradiction, that 7t is rational, so that n = a/b where a,b £ Z 
and b fO. Let a = n/2 in (24.2). Then 

J n =a 2 " +l I„/n\ 

is an integer. By the definition of 

2 n+l /* 1 ~- 

J n = —j— J (1 — x 2 )”cos — xdx 

The integrand is > 0 for —1 < x < 1, so J n >0. Hence J„ / 0 for all n. But 

i |fl| 2 " +l [' 71 

\J„\ < - / cos — xdx 

1 1 “ n\ J- 1 2 

< 2|a| 2n+1 /«! 

Hence J„ —> 0 as n —> +°q. This contradicts Lemma 24.1, so the assumption that n is 
rational is false. □ 

The next, slightly stronger, result was proved by Legendre in his Elements de 
Geometrie of 1794, which, as we remarked in the Historical Introduction, greatly 
influenced the young Galois. 
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Theorem 24.3. The real number K 1 is irrational. 

Proof. Assume if possible that n 2 = a/b where a,b gZ and b 7 ^ 0 . Define 

/(x) =x"(l —x) n /n\ 


and 

G(x) = b n ( n 2n f{x) - n 2n ~ 2 f"{x) + • • • + (-l )'^ 0 / 2 ' 0 W) 

where the superscripts on / indicate derivatives. We claim that any derivative of / 
takes integer values at 0 and 1. Recall Leibniz’s rule for differentiating a product: 

d m n,i\ d r ud m ~ r v 

dx m “\r/ dx r dx m_r 

If both factors x" or (1 — x) n are differentiated fewer than n times, then the value 
of the corresponding term is 0 whenever x = 0 or 1 . If one factor is differentiated n 
or more times, then the denominator nl is cancelled out. Hence G( 0) and G(l) are 
integers. Now 

[G'(x)sin( 7 rx)- 7 rG(x)cos( 7 rx)] = \G"[x) + n 2 G(x)] sin(TTx) 

= b n n 2n+2 f(x) sin ( nx) 

since fix) is a polynomial in x of degree 2 n, so that f( 2 " +2 '> (x) = 0. And this expres¬ 
sion is equal to 

7i 2 a n sin( 7 rx)/(x) 

Therefore 


n / a"sin(7rx)/(x)dx = 
Jo 


C'(,)sin (^) - GMcos(jj) 11 


Jo 


= G(0) + G(1) 

which is an integer. As before the integral is not zero. But 


a"sin(7rx)/(x)dx 


< \a\ n / |sin(7rx)||/(x)|dx 


/o 


nl | v .n 


< | a 

1 r l 


|x"(l-x)"| 


civ 


0 nl 


< A [ |(ax)"(l — x)"|dx 
nl Jo 


which tends to 0 as n tends to +°°. The usual contradiction completes the proof. □ 
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24.2 Transcendence of e 

We move from irrationality to the far more elusive transcendence. Hermite’s orig¬ 
inal proof was simplified by Karl Weierstrass, Hilbert, Adolf Hurwitz, and Paul Gor- 
dan, and it is the simplified proof that we give here. The same holds for the proof of 
Lindemann’s theorem in the next section. 

Theorem 24.4 (Hermite). The real number e is transcendental. 

Proof. Assume that e is not transcendental. Then 

a m e'" H-haie + ao = 0 

where without loss of generality we may suppose that aj £ Z for all j and aofO. 
Define 

„ , xr-'(x-inx-2)P...(x-in)r 

m = -(F^jT- 

where p is an arbitrary prime number. Then / is a polynomial in x of degree mp + 
p — 1. Put 

F (. x ) = f(x) + f (x) + • • • + f^P+P-^) [x) 

and note that = 0. Calculate: 

^(e-TF(x)) = e~*(F'(x) -F(x)) = -e~ x f(x) 

Hence for any j 

aj J e~ x f{x)dx = aj[—e~ x F(x)\ J 0 

= ajF(0) — ap~^F{j)x 

Multiply by e' and sum over j to get 

m / r-j \ m m 

£ ( a i &1 / e_A /W dv ) = F(0) £ aje 1 - £ aj F(j) 

7=0 V JQ / 7=0 7=0 

m mp+p-l 

= £ £ ajf®U) (24.3) 

7=0 i=0 

from the equation supposedly satisfied by e. 

We claim that each f F ( j) is an integer, and that this integer is divisible by p 
unless y' = 0 and i = p — 1. To establish the claim we use Leibniz’s rule again; the only 
non-zero terms arising when / / 0 come from the factor (x — j) p being differentiated 
exactly p times. Since pi /ip — 1)! = p, all such terms are integers divisible by p. In 
the exceptional case j = 0, the first non-zero term occurs when i p 1 , and then 


f( p ~ l \0) = (—l) p ... (— m ) p 
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Subsequent non-zero terms are all multiples of p. The value of equation (24.3) is 
therefore 

K p + ao(-\) p ...(-m) p 

for some K £ Z. If p > max(w, |ao|). then the integer ao(— l) p ... {—m) p is not divis¬ 
ible by p. So for sufficiently large primes p the value of equation (6.3) is an integer 
not divisible by p, hence not zero. 

Now we estimate the integral. If 0 < x < m then 

l/( x )l < m mp+p ~ l /{p — 1 )! 


so 


Y^aje’ / e X f(x)dx 


j =o 


< E 


fl<e- 


Wl 


rj m m P+P- 1 


J 'Jo (P-1)! 


dr 


inp+p-l 




which tends to 0 as p tends to +°°. 

This is the usual contradiction. Therefore e is transcendental. 


□ 


24.3 Transcendence of n 

The proof that n is transcendental involves the same sort of trickery as the previ¬ 
ous results, but is far more elaborate. At several points in the proof we use properties 
of symmetric polynomials from Chapter 18. 

Theorem 24.5 (Lindemann). The real number n is transcendental. 

Proof. Suppose for a contradiction that n is a zero of some non-zero polynomial 
over Q. Then so is in. where i = y/~ 1. Let 8\ (x) £ Q[x] be a polynomial with zeros 
ai = in, (X 2 , ■ ■ ■, cc„. By a famous theorem of Euler, 

e m + 1=0 


so 

(e“* + l)(e “ 2 + 1)... (e“» + 1) = 0 (24.4) 

□ 

We now construct a polynomial with integer coefficients whose zeros are the 

exponents a (1 -1 -[- a /r of e that appear in the expansion of the product in (24.4). 

For example, terms of the form 


e“' • e“' • 1 • 1 • 1 • • • 1 
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give rise to exponents a s + a t . Taken over all pairs s.t we get exponents of the form 
oq + OI 2 , ■ ■ ■ , «„ 1 + a„. The elementary symmetric polynomials of these are sym¬ 
metric in 0 £i,..., On, so by Theorem 18.10 they can be expressed as polynomials in 
the elementary symmetric polynomials of a\,...,a„. These in turn are expressible 
in terms of the coefficients of the polynomial 0] whose zeros are Hence 

the pairs a s + a t satisfy a polynomial equation 0 2 (x) = 0 where 0 ? has rational co¬ 
efficients. Similarly the sums of k of the a’s are zeros of a polynomial 0/ (x) over Q. 
Then 

01 (x) 0 2 (x)... 0 „(x) 

is a polynomial over Q whose zeros are the exponents of e in the expansion of equa¬ 
tion (24.4). Dividing by a suitable power of x and multiplying by a suitable integer we 
obtain a polynomial 0(x) over Z, whose zeros are the non-zero exponents fp,.... p, 
of e in the expansion of equation (24.4). 

Now (24.4) takes the form 

e ^ 1 H-h e fr + e° H-b e° = 0 

that is, 

e/3i+... +e fr +k = 0 (24.5) 

where k £ Z. The term 1 • 1 • • • 1 occurs in the expansion, so k > 0. 

Suppose that 

0 (x) = CX r + Cix' ~ 1 H-b c r 

We know that c r / 0 since 0 is not a zero of 0. Define 

= ^[6(x)Y 

n -> (p-1 )! 

where ,v = rp— 1 and p is any prime number. Define also 

F(x) = f(x)+f r (x) + ■ ■ • +/(W- 1)( X ) 
and note that /( s +p+' ) (x) = 0. As before 

±[e- x F(x)] = -e~*f{x) 

Hence 

c~ x F(x)-F( 0) = - [ e~ y f(y)dy 
Jo 

Putting y = Ax we get 

F(x) — e?F( 0) = —x [ exp[(l — A)x]/(Ax)dA 
Jo 

Let x range over j3i,..., j3 r and sum: by (24.5) 

I F(Pj)+kF( 0) = - £ A/ exp[(l - A)/y/(A/3;)dA 
7=1 7=1 70 


(24.6) 
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We claim that for all sufficiently large p the left-hand side of (24.6) is a non-zero 
integer. To prove the claim, observe that 

if {,) (Pj )=o 

j =i 

if 0 < t < p. Each derivative /W(j3;) with t > p has a factor p, since we must differ¬ 
entiate \0(x)Y’ at least p times to obtain a non-zero term. For any such f, 

j =i 

is a symmetric polynomial in the [f of degree < ,v. Thus by Theorem 18.10 it is a 
polynomial of degree < s in the coefficients c, /c. The factor c s in the definition of 
f(x) makes this into an integer. So for t >p 

if®(Pj)=pkt 

j =i 

for suitable k t £ Z. 

Now we look at E(0). Computations show that 

f0 (t<p- 2) 

/ (?, ( 0) = < C s C P r (t=p~ 1) 

{ Up (t > p) 

for suitable l t £ Z. Consequently the left-hand side of (24.6) is 

mp + kc s c p 


for some m £ Z. Now kf=0,c^0, and c r f 0. If we take 


p > max(fc,|c|,|c r |) 


then the left-hand side of (24.6) is an integer not divisible by p, so is non-zero. 

The last part of the proof is routine: we estimate the size of the right-hand side of 
(24.6). Now 

where 

m(j)= sup \6(XPj)\ 

0<A<1 

Therefore 



I A/ /‘exp[(l-A)/3 ; -]/(A^)dA 

7=1 70 


< y mKiP 

~h (p-'y- 


where 

max / exp[(l — A)/)/]dA 
j J o 

Thus the expression tends to 0 as p tends to +°°. By the standard contradiction, K is 
transcendental. 
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EXERCISES 

The first four exercises outline Cantor’s proof of the existence of transcendental 
numbers, using what are now standard results on infinite cardinals. 

24.1 Prove that R is uncountable, that is, there is no bijection Z —> R. 

24.2 Define the height of a polynomial 

f {t -) — ao -1-b a fit (E Z [f ] 

to be 

h(f) =n+ |a 0 | 4- \-\a n \ 

Prove that there is only a finite number of polynomials over Z of given height 
h. 

24.3 Show that any algebraic number satisfies a polynomial equation over Z. Using 
Exercise 24.2 show that the algebraic numbers form a countable set. 

24.4 Combine Exercises 24.1 and 24.3 to show that transcendental numbers exist. 

The next three exercises give Liouville’s proof of the existence of transcenden¬ 
tal numbers. 

24.5* Suppose that x is irrational and that 

f(x) = a n x" H-f ao = 0 

where ao,...,a n € Z. Show that if p,q &Z and 0, and f{p/q) ^=0, then 

\f{p/q)\ > 1 /q n 

24.6* Now suppose that x— 1 < p/q < x + 1 and p/q is nearer to x than any other 
zero of /. There exists M such that \f'(y) \ < M if x — 1 < y < x + 1. Use the 
mean value theorem to show that 

\p/q — x| > M~ l q~ n 

Hence show that for any r > n and K > 0 there exist only finitely many p and 
q such that 

\p/q-x\ <Kq-' 

24.7 Use this result to prove that £" =1 10 " ! is transcendental. 

24.8 Prove that z C C is transcendental if and only if its real part is transcendental 
or its imaginary part is transcendental. 

24.9 Mark the following true or false. 
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(a) n is irrational. 

(b) All irrational numbers are transcendental. 

(c) Any nonzero rational multiple of n is transcendental. 

(d) n + is/5 is transcendental. 

(e) e is irrational. 

(f) If a and p are real and transcendental then so is Ot -\ p. 

(g) If a and p are real and transcendental then so is a + ip. 

(h) Transcendental numbers form a subring of C. 

(i) The field Q(7T) is isomorphic to Q(?) for any indeterminate 1. 

(j) Q(7t) and Q(e) are non-isomorphic fields. 

(k) Q(tt) is isomorphic to Q(n 2 ). 
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Chapter 25 


What Did Galois Do or Know? 


This is not a scholarly book on the history of mathematics, but it does contain a 
substantial amount of historical material, intended to locate the topic in its context 
and to motivate Galois theory as currently taught at undergraduate level. (At the 
research frontiers, the entire subject is even more general and more abstract.) 

There is a danger in this approach: it can mix up history as it actually happened 
with how we reformulate the ideas now. This can easily be misinterpreted, distort¬ 
ing our view of the past and propagating historical myths. Peter Neumann makes 
this point very effectively in his admirable English translation of Galois’s writings, 
Neumann (2011). The book covers both Galois’s published papers and those of his 
unpublished manuscripts that have survived—very few, even when brief scraps are 
included. 

To set the record straight, we now take a look at what this material tells us about 
what Galois actually did, what he knew, and what he might have been able to prove. 
Placing the material at the end of this book allows us to refer back to all of the 
historical and mathematical material. 

The folklore story is: Galois proved that A 5 is simple, indeed, the smallest simple 
group other than cyclic groups of prime order. From this he deduced that the quintic 
is not soluble by radicals. However, as Neumann states, the first statement is claimed 
without proof (and it is questionable whether Galois possessed one), while the link 
to the second does not appear explicitly anywhere in the extant manuscripts. The 
central issue, and our main focus here, is the relation between solving the quintic 
by radicals and the alternating group A5. It would be easy to imagine, and has often 
been asserted, that Galois viewed these topics in the same way as they have been 
presented in earlier chapters, and that in particular that the key issue, for him, was to 
prove that A 5 is simple. 

Not so. 

However, history is seldom straightforward, especially when sources are frag¬ 
mentary and limited. Closely related statements do appear, enough to justify Galois’s 
stellar reputation among mathematicians and to credit him with the most penetrating 
insights of his period into the solution of equations by radicals and its relation to 
groups of permutations. As Neumann writes: ‘The [First] memoir on the conditions 
for solubility of equations by radicals is undoubtedly Galois’s most important work. 
It is here that he presented his original approach to the theory of equations that has 
now become known as Galois Theory.’ 
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What Did Galois Do or Know? 


25.1 List of the Relevant Material 

Galois’s published papers are five in number, and only one, ‘Analysis of a mem¬ 
oir on the algebraic solution of equations’, is relevant here. After Galois died, his 
manuscripts went to a literary executor, his friend Auguste Chevalier. Chevalier 
passed them on to Liouville, who brought Galois’s work to the attention of the math¬ 
ematical community, probably encouraged by the brother, Alfred Galois. Liouville’s 
daughter Mme de Blignieres gave them to the French Academy of Sciences in 1905 
or 1906, where they were organised into 25 ‘dossiers’ and bound into a single vol¬ 
ume. Parts were published or analysed by Chevalier, Liouville, Jules Tannery, and 
Emile Picard. Bourgne and Azra (1962) published a complete edition. The first and 
currently the only complete English translation is Neumann (2011). This also con¬ 
tains a printed version of the French originals, in parallel with the translation for ease 
of comparison. Scans of the manuscripts are available on the internet at 

www.bibliotheque-institutdefrance.fr/numerisation/ 

The documents referred to below (the dossier numbers are those assigned by the 
Academy) are: 

Analysis of a memoir on the algebraic solution of equations. Bulletin des Sci¬ 
ences Mathematiques, Physiques et Chimiques 13 (April 1830) 271-272. 

Testamentary Letter, 29 May 1832, to Chevalier. 

First Memoir, sent to the Academy. 

Second Memoir, sent to the Academy. 

Dossier 8: Torn fragment related to the First Memoir. 

Dossier 10: Publication project and note on Abel. 

Dossier 15: Fragments on permutations and equations. 

Several other documents refer to groups and algebraic equations, and there are 
some on other topics altogether. 


25.2 The First Memoir 

The document called the First Memoir is the one that Galois sent to the Academy 
on 17 January 1831; it is actually his third submission, the other two having been 
lost. In the opening paragraph to the First Memoir, which functions as an abstract of 
the contents, Galois states that he will present 
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... a general condition satisfied by every equation that is soluble by rad¬ 
icals ■, and which conversely ensures their solubility. An application is 
made just to equations of which the degree is a prime number. Here is 
the theorem given by our analysis: 

In order that an equation of prime degree ... be soluble by radicals, it 
is necessary and it is sufficient that all the roots be rational functions of 
any two of them. 

He adds that his theory has other applications, but ‘we reserve them for another 
occasion.’ 

In this abstract, there is no mention of the quintic as such, although its degree 
5 is prime, so his main theorem obviously applies to it. It is not mentioned in the 
rest of the paper either. There is also no mention of the concept of a group. It is 
hard not to have some sympathy for Poisson and Lacroix, the referees: it looks like 
they did a professional job, and spotted a key weakness in the theorem upon which 
Galois places so much emphasis. (Admittedly, this is not difficult.) Namely: although 
Galois’s condition ‘all the roots be rational functions of any two of them’ is indeed 
necessary and sufficient for solubility by radicals, it is hard to think of any practical 
way to verify it for any specific equation. 

The Historical Introduction mentioned the referees’ statement that ‘one could not 
derive from [Galois’s condition] any good way of deciding whether a given equation 
of prime degree is soluble or not by radicals,’ and the remark by Tignol (1988) that 
Galois’s memoir ‘did not yield any workable criterion to determine whether an equa¬ 
tion is solvable by radicals.’ I also wrote: ‘What the referees wanted was some kind 
of condition on the coefficients that determined solubility; what Galois gave them 
was a condition on the roots’ But I think that a stronger criticism is in order: appar¬ 
ently, there is no algorithmic procedure to check whether the condition on the roots 
is valid. Or to prove that it is not. How, for example, would we use it to prove the 
quintic insoluble? 

It turns out that this judgement is not entirely correct, but further work is needed 
to see why. It is implicit in a table that Galois includes titled ‘Example of Theorem 
VII’, and I’ll come back to that shortly. But he does not make the connection explicit. 


25.3 What Galois Proved 

Before discussing possible reasons for the (to our eyes) curious omission of the 
application to quintics, we review the results that Galois does include in the First 
Memoir. These alone would establish his reputation. 

The work is short, succinct, and clearly written. A modern reader will have no 
difficulty in following the reasoning, once they get used to the terminology. He de¬ 
velops several key ideas needed to prove his necessary and sufficient condition for 
solubility by radicals, which we now recognise as the core concepts of Galois The- 
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ory. It is clear that Galois recognised the importance of these ideas, but, once again, 
he does not say so in the paper. 

After a few preliminaries, which would have been familiar to anyone working in 
the area, Galois presents his first key theorem: 

Proposition 25.1. Let an equation be given of which the m roots are a.h.c, _ There 

will always be a group of permutations of the letters a,b,c,... which will enjoy the 
following property: 

That every function of the roots invariant [a footnote explains this term] under 
the substitutions of this group will be rationally known; 

Conversely, that every function of the roots that is rationally determinable will 
be invariant under the substitutions. 

This is his definition of what we now call the Galois group. It also makes the 
central point about the Galois correspondence, expressed in terms of the roots rather 
than the modern interpretation in terms of the subfield they generate. 

Next, he studies how the group can be decomposed by adjoining the roots of 
auxiliary equations; that is, extending the field. He deduces that when a pth root 
is extracted, for (without loss of generality) prime p , the group must have what we 
would now express as a normal subgroup of index p. This leads to the next big result, 
initially posed as a question: 

Proposition 25.2. Under what circumstances is an equation soluble by radicals ? 

Galois writes ‘... to solve an equation it is necessary to reduce its group suc¬ 
cessively to the point where it does not contain more than a single permutation.’ He 
analyses what happens when the reduction is performed by adjoining ‘radical quan¬ 
tities’. He concludes, slightly obscurely, that the group of the equation must have a 
normal subgroup of prime index, which in turn has a normal subgroup of prime in¬ 
dex, and so on, until we reach the group with a single element. In short: the equation 
is soluble by radicals if and only if its group is soluble. But he fails to state this as an 
explicit proposition. 

Galois goes on to illustrate the result for the general quartic equation, obtaining 
essentially what we found in Section 18.5 of Chapter 18. This of course was a known 
result, and Lagrange had already related it to permutation groups in his Traite de la 
Resolution des Equations Numeriques de Tous les Degres. But instead of continu¬ 
ing to the quintic, and proving that the group is not soluble, Galois does something 
that is in some ways more interesting, but answers another (closely related) question 
instead: 

Proposition 25.3. What is the group of an equation of prime degree n that is soluble 
by radicals? 

His answer is that if the roots are suitably numbered, the group of the equation 
can contain only substitutions of the form 


Xk 1 y Xak+b 


( 25 . 1 ) 
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where the roots are the Xk, the symbols a.h denote constants, and ak + b is to be 
computed modulo n. 

To modern eyes, what he should have remarked at this point is that when n = 5 
the group of all such substitutions has 4.5 = 20 elements (we need 0 ^ a G Z 5 and 
b £ Z 5 ), so it cannot equal S 5 , the group of the general quintic. Moreover, Galois 
definitely knew that for any m the group of the general equation of degree m is the 
symmetric group § m . He states as much in the discussion of his Proposition I: 

In the case of algebraic equations, this group is nothing other than the 
collection of the 1.2.3 ...m possible permutations on the m letters, be¬ 
cause in this case, only the symmetric functions are rationally deter¬ 
minable. 

By ‘algebraic equation’ he meant what we now call the ‘general polynomial equa¬ 
tion’. Galois distinguished ‘numerical’ and ‘literal’ equations: those in which the co¬ 
efficients are specific numbers, and those in which they are arbitrary symbols. He 
is clearly thinking of literal equations here. But to a casual reader this statement is 
somewhat confusing. 

Anyway, Galois does no such thing. Instead, he in effect observes that once you 
have two numbers of the form ak + b,a'k + b 1 , you can generate all numbers of this 
form. Whence the criterion that given any two roots, the others are all rationally 
expressible. 


25.4 What is Galois Up To? 

Taking inspiration and historical information from Neumann (2011), 1 now think 
there is a sensible explanation of what at first sight seems to be a strange series of 
omissions and obscurities, in which Galois wanders all round a key idea without ever 
putting his finger on it. Namely: Galois wasn’t interested in discussing the quintic. 
He was after something quite different. 

We know that he had taken on board the work of Ruffini and Abel, because 
Dossier 10 refers to Abel’s proof that the quintic is insoluble, and Dossier 8 states: 

It is today a commonly known truth that general equations of degree 
greater than the 4 th cannot be solved by radicals. 

This truth has become commonly known to some extent by hearsay 
and even though most geometers do not know the proofs of it given 
by Ruffini, Abel, etc., proofs founded upon the fact that such a solution 
is already impossible for the fifth degree. 

This being so, why should Galois place any emphasis on the quintic? I think 
he had his sights set on something more ambitious: to say something new about 
solutions by radicals. 
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The first piece of evidence is the continuation of the above quotation: ‘In the first 
instance it would seem that the [theory] of solution of equations by radicals would 
end there.’ Unfortunately the text on that side of the paper ends at this point, and the 
other side merely lists titles of four memoirs. 

Another is Dossier 9, which includes: 

The proposed goal is to determine the characteristics for the solubility of 
equations by radicals... that is the question to which we offer a complete 
solution. 

He then acknowledges that in practice ‘the calculations are impracticable,’ but 
attempts to justify the importance of the result nonetheless: 

... most of the time in algebraic analysis one is led to equations all of 
whose properties one knows beforehand: properties by means of which it 
will always be easy to answer the question by the rules we shall expound 
... I will cite, for example, the equations which give the division of 
elliptic functions and which the celebrated Abel has solved ... 

Galois refers to these ‘modular equations’ from the theory of elliptic functions 
elsewhere, and they presumably played a major role in his thinking. 

Dossier 10 states: 

... Abel did not know the particular circumstances of solution by rad¬ 
icals ... he has left nothing on the general discussion of the problem 
which has occupied us. Once and for all, what is remarkable in our the¬ 
ory [is to be able to answer yes or no in all cases, crossed out\. 

Over and over again Galois places emphasis not on proving equations such as 
the general quintic insoluble, but on finding equations that are soluble. The title of 
the First Memoir says it all: ‘Memoir on the conditions for solubility of equations 
by radicals.’ So does that of the Second Memoir: ‘On primitive equations which are 
soluble by radicals.’ Galois is not interested in impossibility proofs. To him, they are 
old hat; they do not lead anywhere new. This, I suspect, is why he does not use the 
quintic as an example in the First Memoir; it is most definitely why his main general 
result is Proposition VII. In modern terms, he is telling us that an equation is soluble 
by radicals if and only if its Galois group is conjugate to a subgroup of the affine 
general linear group A<GL(l,n), which consists of the transformations (25.1). These 
are the equations that Galois considers interesting; this is the theorem of which he is 
justly proud, since it constitutes a major advance and characterises soluble equations. 

It is also worth remarking that the form in which Galois states Proposition VII 
does not involve the notion of a group. It would be immediately comprehensible 
to any algebraist of the period, without having to explain to them the new—and 
rather unorthodox—concept of a group. This is reminiscent of the way that Isaac 
Newton used classical geometry rather than calculus to prove many statements in 
his Principia Mathematica , even though he probably used calculus to derive them in 
the first place. Ironically, by trying—for once—to make his ideas more accessible, 
Galois obscured their importance. 
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25.5 Alternating Groups, Especially A 5 

Neumann (2011) discusses several myths about Galois. Prominent among them 
is the claim that he proved the alternating group A„ is simple when n> 5. However, 
these groups are not mentioned in any of the works of Galois published by Liou- 
ville in 1846, which was the main source for professional mathematicians. There is 
no mention even of A 5 , and even the symmetric groups are mentioned only to illus¬ 
trate Proposition I of the First Memoir (see the quotation in Section 25.3) and as an 
example for Proposition V when the degree is 4. 

One reason why Galois did not mention the simplicity of A„ or even of A 5 is 
that he didn’t need it. His necessary and sufficient condition for solubility—having 
a group conjugate to a subgroup of AGL(l,n) —was all he needed. We can prove 
that A 5 cannot occur rather easily: its order is 60 while that of AGL( 1,5) is only 20. 
Simplicity is not the issue. However, Galois doesn’t even say that: insolubility is also 
not the issue, for him. 

But... 

As Neumann recognises, Galois does give brief mention to alternating groups in 
a few manuscripts. One is Dossier 15, which consists of a series of short headings. 
It looks suspiciously like the outline of a lecture course. Could it be the one on 
advanced algebra that he offered on 13 January 1831 ? It might be a plan for a memoir, 
or even a book, for all we know. Crossed out, we find the words: 

Example. Alternate groups (Two similar groups). Properties of the alternate 
groups. 

By ‘two similar groups’ Galois is referring to two cosets with the same structure: 
this was his way to say ‘normal subgroup of index 2’, no doubt in §„. The same text 
appears slightly later, also crossed out. Later still we find ‘New proof of the theorem 
relative to the alternate groups’, not crossed out. This is followed shortly by ‘One 
may suppose that the group contains only even substitutions’, which I take to be a 
‘without loss of generality we may assume the group is contained in the alternating 
group’. 

There is a simple way to set this up, which was known to every algebraist, and 
Galois would have learned it at his mother’s knee. It uses the quantity 8 defined 
in (1.13). This changes sign if any two roots are interchanged; that is, it is invariant 
under A„ but not S„. However, its square A = 8 2 is a symmetric function of the roots 
and therefore can be expressed as a function of the coefficients. It is the discriminant 
of the equation, so named because its traditional role is to provide a computable 
algebraic test for the existence of a multiple root. Indeed, A = 0 if and only if the 
equation has a multiple root. 

Since A is a rational function of the coefficients, we can adjoin 8 by taking a 
square root. As far as solving equations by radicals goes, this is harmless, and it 
reduces the group to its intersection with A„. Probably Galois had something like 
this in mind. 



302 


What Did Galois Do or Know? 


The same document includes a reference to Cauchy’s work on permutations, in¬ 
cluding 

Theorem. If a function on m indeterminates is given by an equation of 
degree m all of whose coefficients [are symmetric functions, permanent 
or alternating, of these indeterminates], this function will be symmet¬ 
ric, permanent or alternating, with respect to all letters or at least with 
respect to m — 1 among them. 

Theorem. No algebraic equation of degree higher than 4 may be solved 
or reduced. 

So there is no doubt that Galois was aware of the link between S 5 , A 5 , and the quintic. 


25.6 Simple Groups Known to Galois 

What about simple groups? Neumann points out that Galois definitely knew 
about simple groups (his term is ‘indecomposable’). But the examples he cites are 
the projective special linear groups PSL(2,p) for prime p. His Second Memoir was 
clearly heading in that direction, and this fact is stated explicitly in the letter to Cheva¬ 
lier: ‘[this group] is not further decomposable unless p = 2 or p = 3.’ 

This bring us to another statement in the letter to Chevalier, which Neumann 
reasonably considers a ‘mysterious assertion’. Namely: 

The smallest number of permutations which can have an indecompos¬ 
able [simple] group, when this number is not prime, is 5.4.3. 

That is, the smallest order for a simple group is 60. Neumann argues persuasively 
that Galois was thinking of PSL(2,5), not A 5 . Agreed, these groups are isomorphic, 
but Galois writes extensively about what we now call PSL(2,p), and says virtually 
nothing about A„. 

Neumann also provides a fascinating discussion of whether Galois actually pos¬ 
sessed a proof that the smallest order for a simple group is 60. 

He was so insightful that, perhaps, yes, he could have known it. Never¬ 
theless, I very much doubt it. How could he have excluded orders such 
as 30, 32, 36, 40, 48, 56? With Sylow’s theorems and some calculation, 
such orders can be excluded... but... it seems unlikely that Galois had 
Sylow’s theorems available to him. Besides, there is no hint in any of 
the extant manuscripts and scraps of the kind of case-by-case analysis 
that is needed... 

It is of course conceivable that Galois knew the results we now call Sylow’s Theo¬ 
rem. He was very clever, and his known insights into group theory are impressive. 
However, even granting that, the viewpoint needed to prove Sylow’s Theorem seems 
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too sophisticated for the period. The biggest problem is that it is difficult to imagine 
him failing to tell anyone about such discoveries, and some hint ought to have sur¬ 
vived among his papers. In their absence, Neumann’s last point is especially telling. 
On the other hand, and grasping at straws, Galois’s affairs were somewhat chaotic. 
Like most mathematicians, he probably threw a lot of scraps away, especially ‘rough 
work’. In the Historical Introduction we saw that when at school he did a lot of work 
in his head, instead of on paper—and was criticised for it. So the absence of evidence 
is not evidence of absence. 


25.7 Speculations about Proofs 

It is worth examining just what a mathematician of the period would have needed 
to prove Galois’s statement about the smallest order for a simple group. What follows 
illustrates what might have been possible given a little ingenuity. We use only a few 
basic theorems in group theory, all of which have easy proofs, well within Galois’s 
capabilities. We also make no claim that he was aware of any of this material. 

He knew about subgroups, cosets, conjugacy, and normal subgroups. He read 
Lagrange and must have known Lagrange’s theorem: the order of a subgroup (or 
element) divides the order of the group. 

He could have defined the normaliser Ng(H) of a subgroup of G, which is the 
set of all g £ G such that g~ l Hg = H. This is obviously a subgroup, and // <iN(fH). 
Moreover, it is evident that the number of distinct conjugates of H is equal to the 
index | G : Nq(H) |. The index of a subgroup K C G, usually denoted |G : H\, is equal 
to |G|/|//| for finite groups, and is the number of distinct cosets (left or right) of H 
in G. Galois knew about cosets (though he called them ‘groups’.) 

Galois would also have been aware of what we now call the centraliser Cc{g) of 
an element g £ G: the set of all h £ G such that h 1 gh = g. This too is a subgroup, 
and the number of distinct conjugates of g is equal to the index |G : Q; (7/) |. This line 
of thinking leads inevitably to the class equation discussed in Chapter 14 (14.2). We 
rewrite it in the form: 

|G| = 1+£|G:Cg(s/)| (25.2) 

gi 

where {#,•} is a set of representatives of the non-identity conjugacy classes of G. The 
extra l takes care of the identity. As we will see, the class equation is a surprisingly 
powerful tool when investigating simple groups of small order. 

Indeed, using the class equation, Galois would easily have been able to prove 
Theorem 14.15, published in 1845 by Cauchy. This is a limited converse to La¬ 
grange’s theorem: if a prime number p divides the order of a finite group, the group 
has an element of order p. The class equation is the key to the proof, as we saw in 
Chapter 14. 

It turns out that for putative simple groups of small order, Cauchy’s Theorem 
works fairly well as a substitute for Sylow’s theorem(s). Some systematic counting 
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of elements then goes a long way. However, it is a bit of a scramble. The main results 
we need are: 

Lemma 25.4. Let G be a non-cyclic finite simple group. Then: 

(1) The normaliser of any proper subgroup of G is a proper subgroup. 

(2) The centraliser of any element ofG is a proper subgroup ofG. 

(3) No prime p can divide the indices of all proper subgroups ofG. 

(4) There cannot exist a unique proper subgroup ofG of given order k > 1. 

Proof. (1) If not, the subgroup is normal. 

(2) If not, the element generates a cyclic normal subgroup. 

(3) If such a p exists, the class equation takes the form 


1 + Ci H-+ Ck — |G| 

where the Cj are the indices of centralisers of non-identity elements, which by ( 2 ) 
are proper subgroups. Therefore p\Cj for all j. Also p divides |G| since p divides c i, 
which divides |G|. So the class equation taken (mod p) implies that 1=0 (mod p), 
a contradiction. 

(4) Suppose that H is the unique subgroup of order k. The order of any conjugate 
g fig is also k. so g 1 II g = H for all g £ G. Therefore H<\G, a contradiction. □ 

We need one further idea. Galois’s definition of ‘normal’ immediately implies 
that a subgroup of index 2 is normal. More generally, a little thought about the con¬ 
jugates of a subgroup leads to a useful generalisation: 

Lemma 25.5. Let G be a finite group and let H be a non-normal subgroup of index 
m. Then G has a proper normal subgroup of index dividing ml In particular, G cannot 
be simple if\G\ > ml 

Proof. The subgroup H has m conjugates //, = g- 1 Hg, for 1 < i < m. For any g £ G 
the conjugate g 1 llg is one of the The map 0 : G —>■ S m defined by 0(g) = g, is a 
homomorphism. Its kernel K is a normal subgroup of G of index at most |S,„| = ml. 
If k £ K then k~ l Hk = H, so K C Ng(H) 7 ^ G, and K is proper. □ 

Armed with these weapons, Galois would easily have been able to prove: 

Theorem 25.6. Let p.q be distinct primes and k > 2. A finite non-cyclic simple group 
cannot have order p k , pq, 2 p k , 3 p k , 4 p k , or 4pfor p>l. 

Proof. (1) Order p k is ruled out by Lemma 25.4, since p divides the index of any 
proper subgroup. This is how we proved Theorem 23.5, but there we obtained a 
further consequence: the group has non-trivial centre. 

(2) Suppose G is simple of order pq. By Cauchy’s Theorem it has subgroups H of 
order p and K of order q. All nontrivial proper subgroups have order p or q. Each of 
//. K must equal its normaliser, otherwise it would be a normal subgroup. Therefore 
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H has q conjugates, which intersect pairwise in the identity, and K has p conjugates, 
which intersect pairwise in the identity. Therefore G has 1 element of order 1, at least 
(p— 1 )q elements of order p, and at least p(q— 1) elements of order q. These total 
2 pq — p — q+l = pq + (p — l)(q — l) elements, a contradiction since p,q > 1 . 

(3) Suppose G is simple of order 2 p k . There is no subgroup of index 2, so every 
proper subgroup has index divisible by p, contrary to Lemma 25.4(3). 

(4) Suppose G is simple of order 3 p k . Since 3 p k > 8, Lemma 25.5 implies that there 
is no subgroup of index < 3. Therefore every proper subgroup has index divisible by 
p, contrary to Lemma 25.4(3). 

(5) Suppose G is simple of order 4 p k . If p = 2 apply part (1). Otherwise 4 p k > 36. 
By Lemma 25.5 there is no subgroup of index < 4, so every proper subgroup has 
index divisible by p, contrary to Lemma 25.4(3). 

( 6 ) Suppose G is simple of order 4 p. Since p > 7 we have |G| > 24, so by 

Lemma 25.5 there is no proper subgroup of index < 4. In particular there is no sub¬ 
group of order p, contrary to Cauchy’s Theorem. □ 

We now present a proof, using nothing that could not easily have been known to 
Galois, of his mysterious statement: 

Theorem 25.7. There is no non-cyclic simple group of order less than 60. 

Proof Let G be a non-cyclic simple group of order less than 60. This rules out 
groups of prime order, and Theorem 25.6 rules out many other orders. Only six orders 
survive: 

20 30 40 42 45 56 
and we dispose of these in turn. 

Throughout, we apply Lemma 25.4(1, 2) without further comment. 

Order 20 

By Lemma 25.5 G has no subgroups of index < 3. Therefore the possible orders 
of nontrivial proper subgroups are 2, 4, 5 only. By Cauchy’s Theorem there exist 
elements of orders 2 and 5. 

The class equation does not lead directly to a contradiction, so we argue as fol¬ 
lows. Let N be the normaliser of any order-5 subgroup H. This is a proper subgroup. 
Since all proper subgroups have order 1, 2, 4, or 5, we have |iV| = 5. Therefore // 
has 20/5 = 4 distinct conjugates. Since 5 is prime, these conjugates intersect only in 
the identity. Each non-identity element of Z 5 has order 5, so there are 4 elements of 
order 5 in each order-5 subgroup. Therefore together these conjugates contain 4.4 = 
16 elements of order 5. 

There is also at least one element of order 2. Its normaliser has order 2 or 4, so 
cannot contain an element of order 5. It therefore has 5 distinct conjugates by any 
order-5 element. Therefore G has at least 1+16+5 = 22 elements, contradiction. 
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Order 30 

Since 30 > 4!, Lemma 25.5 implies that G has no subgroups of index < 4. 
Therefore the possible orders of nontrivial proper subgroups are 2, 3, 5, 6 only. By 
Cauchy’s Theorem there exist elements of orders 2, 3, and 5. 

The class equation can be used here, but there is a simpler argument. The nor- 
maliser of any Z 5 subgroup has order 5, hence index 6 . Thus there are at least 
6.4 = 24 elements of order 5. The normaliser of any Z 3 subgroup has order 3 or 
6 , hence index 10 or 5. Thus there are at least 5.2 = 10 elements of order 3. But 
24 + 10 = 34 > 30, a contradiction. 

Order 40 

Lemma 25.5 implies that G has no subgroups of index < 4. Therefore the possible 
orders of nontrivial proper subgroups are 2, 4, 5, 8 only. By Cauchy’s Theorem there 
exist elements of orders 2 and 5. 

The normaliser of any Z 5 subgroup has order 5, hence index 8 . Thus there are at 
least 8.4 = 32 elements of order 5. Each has centraliser of order 5, so its conjugacy 
class has 8 elements. Any further order-5 element gives rise to 32 more elements for 
the same reason, not conjugate to the above, which is impossible. So we have found 
all order-5 elements and their conjugacy classes. 

The centraliser of any element of order 2 k has order 2, 4, or 8 , hence index 20, 
10, or 5. 

The class equation therefore becomes 

40= 1+32 +5a+10/? +20c 
so 

7 = 5 a+ 10/? +20c 

which is impossible since 5 \1. 

Order 42 

Lemma 25.5 implies that G has no subgroups of index < 4. Therefore the possible 
orders of nontrivial proper subgroups are 2, 3, 6 , 7 only. Their indices are 21, 14, 7, 
and 6 . The class equation takes the form 

42= l+6a + 7/?+14c + 21r/ 

where a arises from elements of order 7. Consider this (mod 7) to deduce that 
a=l (mod 7). If a = 1 then there is a unique Z 7 subgroup. But this contradicts 
Lemma 25.4(4). Otherwise a > 8 , which yields at least 6.8 = 48 elements of order 7, 
contradiction. 

Order 45 

Lemma 25.5 implies that G has no subgroups of index < 4. Therefore the possible 
orders of nontrivial proper subgroups are 3, 5, 9 only. Their indices are 15, 9, and 5. 
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The centraliser of any order-5 element has order 5, index 9. So there are at least 
9.4 = 36 elements of order 5. 

The centraliser of any order-3 element has order 3 or 9, index 15 or 5. So there 
are at least 2.5 = 10 elements of order 3, giving at least 36 + 10 = 46 elements, 
contradiction. 

Order 56 

Lemma 25.5 implies that G has no subgroups of index < 4. Therefore the possible 
orders of nontrivial proper subgroups are 2, 4, 7, 8 only. Their indices are 28, 14, 8 , 
and 7. 

The normaliser of any Z 7 subgroup has order 7, index 8 , yielding at least 6.8 = 48 
elements of order 7. 

The normaliser of any Z 2 subgroup has order 2, 4, or 8 , index 28, 14, or 7, yield¬ 
ing at least 7 elements of order 2. 

Together with the identity, these give all 56 elements. Therefore there are exactly 
48 order-7 elements and 7 order-2 elements. 

The centraliser of any order-7 element must have order 7, index 8 . So there are 6 
conjugacy classes of order-7 elements. 

The centraliser of any order-2 element must have order 2, 4, or 8 , index 28, 14, 
or 7. 

The class equation takes the form 

56= 1 + 48 + 7 a + 14b + 28c 

soa = 1, Z? = c = 0 and there are precisely 7 order-2 elements, all conjugate to each 
other. Their centralisers have order 8 , so do not contain any order-7 element; there¬ 
fore each has the same centraliser. This is the unique order -8 subgroup, contradicting 
Lemma 25.4(4). □ 

Galois would have had little difficulty with these orders. If he needed scrap paper 
calculations, they would have been short, and easily lost or thrown away. However, 
history relies on written evidence, and there is no documentary evidence that Galois 
ever proved Theorem 25.7. However, the above proof makes it plausible that Galois 
could have known how to prove that the smallest non-cyclic simple group has order 
60. 


EXERCISES 

25.1 Prove, using the methods of this chapter, that a simple group cannot have order 
5 p k where k > 2 and p> 5 is prime. 


25.2 Using the methods of this chapter, extend the list of impossible orders for non- 
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cyclic simple groups from 61 upwards, as far as you can using the methods of 
this chapter. 

(Using more advanced methods it can be proved that the next possible order is 
168, so there are plenty of orders to try. Orders 72,80,84,90 seem to require 
new ideas and may be beyond the methods of this chapter.) 
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